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Lyme Disease Vaccines 

5 

Field of the Invention 

The present invention relates to novel vaccines for the prevention or attenuation of Lyme 
disease. The invention further relates to isolated nucleic acid molecules encoding antigenic 
10 polypeptides of Borrelia burgdorferi. Antigenic polypeptides are also provided, as are vectors, 
host cells and recombinant methods for producing the same. The invention additionally relates to 
diagnostic methods for detecting Borrelia gene expression. 

15 Background of the Invention 

Lyme disease (Steere, A.C., Proc. Natl. Acad. Sci. USA 97:2378-2383 (1991)), or Lyme 
borreliosis, is presently the most common human disease in the United States transmitted by an 
arthropod vector (Center for Disease Control, Morbid. Mortal. Weekly Rep. 46(23 ):53 1-535 

20 (1997)). Further, infection of house-hold pets, such as dogs, is a considerable problem. 

While initial symptoms often include a rash at the infection point, Lyme disease is a 
multisystemic disorder that may include arthritic, carditic, and neurological manifestations. While 
antibiotics are currently used to treat active cases of Lyme disease, B. burgdorferi persists even 
after prolonged antibiotic treatment. Further, B. burgdorferi can persist for years in a mammalian 

25 host in the presence of an active immune response (Straubinger, R. et aL, J. Clin. Microbiol. 
35:111-116 (1997); Steere, A., N. Engl. J. Med. 527:586-596 (1989)). 

Lyme disease is caused by the related tick-borne spirochetes classified as Borrelia 
burgdorferi sensu lato (including B. burgdorferi sensu stricto, B. qfzelii, B. garinii). Although 
substantial progress has been made in the biochemical, ultrastructural, and genetic characterization 

30 of the organism, the spirochetal factors responsible for infectivity, immune evasion and disease 
pathogenesis remain largely obscure. 

A number of antigenic B. burgdorferi cell surface proteins have been identified. These 
include the outer membrane surface proteins (Osp) OspA, OspB, OspC and OspD. OspA and 
OspB are encoded by tightly linked tandem genes which are transcribed as a single transcriptional 

35 unit (Brusca, J. et aL / Bacteriol. 773:8004-8008 (1991)). The most-studied B. burgdorferi 
membrane protein is OspA, a lipoprotein antigen expressed by borreliae in resting ticks and the 
most abundant protein expressed in vitro by most borrelial isolates (Barbour, A.G., et al. y 
Infection & Immunity 47:795-804 (1983); Howe, T.R., et al. t Science 227:645 (1985)). 
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A number of different types of Lyme disease vaccines have been shown to induce 
immunological responses. Whole-cell B. burgdorferi vaccines, for example, have been shown to 
induce both immunological responses and protective immunity in several animal models 
(Reviewed in Wormser, G., Clin. Infect. Dis. 27:1267-1274 (1995)). Further, passive immunity 

5 has been demonstrated in both humans and other animals using B. burgdorferi specific antisera. 

While whole-cell Lyme disease vaccines confer protective immunity in animal models, use 
of such vaccines presents the risk that responsive antibodies will produce an autoimmune 
response (Reviewed in Wormser, G., supra). This problem is at least partly the result of the 
production of B. burgdorferi specific antibodies which cross-react with hepatocytes and both 

10 muscle and nerve cells. B. burgdorferi heat shock proteins and the 41-kd flagellin subunit are 
believed to contain antigens which elicit production of these cross-reactive antibodies. 

Single protein subunit vaccines for Lyme disease have also been tested. The cell surface 
proteins of 5. burgdorferi are potential candidates for use in such vaccines and several have been 
shown to elicit protective immune responses in mammals (Probert, W. et al., Vaccine 75:15-19 

15 (1997); Fikrig, E. et aL, Infect Immun. 63:1658-1662 (1995); Langerman S. et aL, Nature 

372:552-556 (1994); Fikrig, E. et aL, J. Immunol. 148:2256-2260 (1992)). Experimental OspA 
vaccines, for example, have demonstrated efficacy in several animal models (Fikrig, E., et aL, 
Proc. Natl Acad Sci. USA 89:5418-5421 (1992); Johnson, B.J., et aL, Vaccine 73:1086-1094 
(1996); Fikrig, E., et al f Infect. Immun. 60:657-661 (1992); Chang, Y.F., et aL, Infection & 

20 Immunity (53:3543-3549 (1995)), and OspA vaccines for human use are under clinical evaluation 
(Keller, D., et aL, J. Am. Med Assoc. 277:1764-1768 (1994); Van Hoecke, C, et aL, Vaccine 
74:1620-1626 (1996)). Passive immunity is also conferred by antisera containing antibodies 
specific for the full-length OspA protein. Further, vaccination with plasmid DNA encoding OspA 
has been demonstrated to elicit protective immune responses in mice (Luke, C. et aL, J. Infect. 

25 Dis. 775:91-97 (1997); Zhong, W. et aL, Eur. J. Immunol. 26:2749-2757 (1996)). 

Recent immunofluorescence assay observations indicate that during tick engorgement the 
expression of OspA by borreliae diminishes (deSilva, A.M., et aL, J. Exp. Med 753:271-275 
(1996)) while expression of other proteins, exemplified by OspC, increases (Schwan, T.G., et 
aL, Proc. Natl. Acad Sci. USA 92:2909-2913 (1985)). By the time of transmission to hosts, 

30 spirochetes in the tick salivary glands express little or no OspA. This down-modulation of OspA 
appears to explain the difficulties in demonstrating immune responses to this antigen early in 
infection following tick bites (Kalish, R.A., et aL, Infect. Immun. 63:2228-2235 (1995); Gem, 
L., etal., J. Infect. Dis. 767:971-975 (1993); Schiable, U.E., etal. f Immunol. Lett. 36:219-226 
(1993)) or following challenge with limiting doses of cultured borreliae (Schiable, U.E., et aL, 

35 Immunol. Lett. 36:219-226 (1993); Barthold, S.W. and Bockenstedt, L.K., Infect. Immun. 
67:4696-4702 (1993)). 

Furthermore, OspA-specific antibodies are ineffective if administered after a borrelial 
challenge delivered by syringe (Schiable, U.E., et al., Proc. Natl. Acad ScL USA 87:3768-3772 
(1990)) or tick bite (deSilva, A.M., et aL, J. Exp. Med 783:271-275 (1996)). To be efficacious, 
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OspA vaccines must elicit protective levels of antibody which are maintained throughout periods 
of tick exposure in order to block borrelia transmission from the arthropod vector. 

Vaccines in current use against other pathogens include in vrw-expressed antigens which 
could boost anamnestic responses upon infection, potentiate the action of immune effector cells 

5 and complement, and inhibit key virulence mechanisms. OspC is both expressed during infection 
(Montgomery, R.R., et aL, J. Exp. Med. 785:261-269 (1996)) and a target for protective 
immunity (Gilmore, R.D., etal, Infect. Immun. 64:2234-2239 (1996); Probert, W.S. and 
LeFebvre, R.B., Infect Immun. 62:1920-1926 (1994); Preac-Mursic, V., et aL, Infection 
20:342-349 (1992)), but mice immunized with this protein were only protected against challenge 

10 with the homologous borrelial isolate (Probert, W.S., et al., 7. Infect. Dis. 775:400405 (1997)). 
Identification of in viw-expressed, and broadly protective, antigens of B. burgdorferi has 
remained elusive. 

Summary of the Invention 

15 The present invention provides isolated nucleic acid molecules comprising polynucleotides 

encoding the B. burgdorferi peptides having the amino acid sequences shown in Table 1. Thus, 
one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides 
having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1 ; (b) a 

20 nucleotide sequence encoding any of the amino acid sequences of the full-length polypeptides 
shown in Table 1 but minus the N-terminal methionine residue, if present; (c) a nucleotide 
sequence encoding any of the amino acid sequences of the truncated polypeptides shown in Table 
1; and (d) a nucleotide sequence complementary to any of the nucleotide sequences in (a), (b), or 
(c) above. 

25 Further embodiments of the invention include isolated nucleic acid molecules that 

comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more 
preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in 
(a), (b), (c), or (d) above, or a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide in (a), (b), (c), or (d) above. This polynucleotide which hybridizes 

30 does not hybridize under stringent hybridization conditions to a polynucleotide having a 

nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid 
embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides 
which encode the amino acid sequences of epitope-bearing portions of a B. burgdorferi 
polypeptide having an amino acid sequence in (a), (b), or (c) above. 

35 The present invention also relates to recombinant vectors, which include the isolated 

nucleic acid molecules of the present invention, and to host cells containing the recombinant 
vectors, as well as to methods of making such vectors and host cells and for using these vectors 
for the production of B. burgdorferi polypeptides or peptides by recombinant techniques. 

The invention further provides isolated B. burgdorferi polypeptides having an amino acid 



WO 98/59071 



PCT/US98/12718 



sequence selected from the group consisting of: (a) an amino acid sequence of any of the full- 
length polypeptides shown in Table 1 ; (b) an amino acid sequence of any of the full-length 
polypeptides shown in Table 1 but minus the N-terminal methionine residue, if present; (c) an 
amino acid sequence of any of the truncated polypeptides shown in Table 1 ; and (d) an amino acid 
5 sequence of an epitope-bearing portion of any one of the polypeptides of (a), (b), or (c). 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% similarity to those described in (a), (b), (c), or (d) above, as well as 
polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% 
10 identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
those above; as well as isolated nucleic acid molecules encoding such polypeptides. 

The present invention further provides a vaccine, preferably a multi-component vaccine 
comprising one or more of the B. burgdorferi polypeptides shown in Table 1, or fragments 
thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the 
15 B. burgdorferi pplypeptide(s) are present in an amount effective to elicit an immune response to 
members of the Borrelia genus in an animal. The B. burgdorferi polypeptides of the present 
invention may further be combined with one or more immunogens of one or more other bomelial 
or non-borrelial organisms to produce a multi-component vaccine intended to elicit an 
immunological response against members of the Borrelia genus and, optionally, one or more non- 
20 borrelial organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" 
DNA, wherein the DNA encodes one or more borrelial polypeptides and, optionally, one or more 
polypeptides of a non-borrelial organism. The DNA encoding one or more polypeptides may be 
constructed such that these polypeptides are expressed fusion proteins. 
25 The vaccines of the present invention may also be administered as a component of a 

genetically engineered organism. Thus, a genetically engineered organism which expresses one 
or more B. burgdorferi polypeptides may be administered to an animal. For example, such a 
genetically engineered organism may contain one or more B. burgdorferi polypeptides of the 
present invention intracellulariy, on its cell surface, or in its periplasmic space. Further, such a 
30 genetically engineered organism may secrete one or more B. burgdorferi polypeptides. 

The vaccines of the present invention may be co-administered to an animal with an 
immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response in an animal 
to one or more members of the Borrelia genus, e.g., B. burgdorferi sensu stricto, B. afzelii, and 
35 B. garinii, comprising administering to the animal a vaccine as described above. 

The invention further provides a method of inducing a protective immune response in an 
animal, sufficient to prevent or attenuate an infection by members of the Borrelia genus, 
comprising administering to the animal a composition comprising one or more of the polypeptides 
shown in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may 
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be conjugated to another immunogen and/or administered in admixture with an adjuvant 

The invention further relates to antibodies elicited in an animal by the administration of one 
or more B. burgdorferi polypeptides of the present invention. 

The invention also provides diagnostic methods for detecting the expression of genes of 
5 members of the Borrelia genus in an animal. One such method involves assaying for the 
expression of a gene encoding Borrelia peptides in a sample from an animal. This expression 
may be assayed either directly (e.g., by assaying polypeptide levels using antibodies elicited in 
response to amino acid sequences shown in Table 1) or indirectly (e.g., by assaying for 
antibodies having specificity for amino acid sequences shown in Table 1). An example of such a 
10 method involves the use of the polymerase chain reaction (PGR) to amplify and detect Borrelia 
nucleic acid sequences. 

The present invention also relates to nucleic acid probes having all or part of a nucleotide 
sequence shown in Table 1 which are capable of hybridizing under stringent conditions to 
Borrelia nucleic acids. The invention further relates to a method of detecting one or more Borrelia 
15 nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids 
encoding Borrelia polypeptides, comprising: 

a) contacting the sample with one or more of the above-described nucleic acid probes, 
under conditions such that hybridization occurs, and 

b) detecting hybridization of said one or more probes to the Borrelia nucleic acid present 
20 in the biological sample. 

Detailed Description 

The present invention relates to recombinant antigenic B. burgdorferi polypeptides and 
fragments thereof. The invention also relates to methods for using these polypeptides to produce 
25 immunological responses and to confer immunological protection to disease caused by members 
of the genus Borrelia. The invention further relates to nucleic acid sequences which encode 
antigenic B. burgdorferi polypeptides and to methods for detecting Borrelia nucleic acids and 
polypeptides in biological samples. The invention also relates to Borrelia specific antibodies and 
methods for detecting such antibodies produced in a host animal. 

30 

Definitions 

The following definitions are provided to clarify the subject matter which the inventors 
consider to be the present invention. 

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state 
35 or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, 
fungi, viruses and metazoan parasites which either produce a disease state or render an animal 
infected with such an organism susceptible to a disease state (e.g., a secondary infection). 
Further included are species and strains of the genus Borrelia which produce disease states in 
animals. 
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As used herein, the term "organism" means any living biological system, including 
viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Borrelia" means any species or strain of bacteria which is 
members of the genus Borrelia. Included within this definition are Borrelia burgdorferi sensu lato 
5 (including B. burgdorferi sensu stricto, B. afzelii, B. garinii), B. andersonii, B. anserina, B. 
japonica, B. coriaceae, and other members of the genus Borrelia regardless of whether they are 
known pathogenic agents. 

As used herein, the phrase "one or more B. burgdorferi polypeptides of the present 
invention" means the amino acid sequence of one or more of the B. burgdorferi polypeptides 
10 disclosed in Table 1. These polypeptides may be expressed as fusion proteins wherein the 
B. burgdorferi polypeptides of the present invention arc linked to additional amino acid 
sequences which may be of borrelial or non-bonelial origin. This phrase further includes 
fragments of the B. burgdorferi polypeptides of the present invention. 

As used herein, the phrase "full-length amino acid sequence" and "full-length polypeptide" 
15 refer to an amino acid sequence or polypeptide encoded by a full-length open reading frame 
(ORF). An ORF may be defined as a nucleotide sequence bounded by stop codons which 
encodes a putative polypeptide. An ORF may also be defined as a nucleotide sequence within a 
stop codon bounded sequence which contains an initiation codon (e.g., a methionine or valine 
codon) on the 5' end and a stop codon on the 3' end. 
20 As used herein, the phrase "truncated amino acid sequence" and "truncated polypeptide" 

refer to a sub-sequence of a full-length amino acid sequence or polypeptide. Several criteria may 
also be used to define the truncated amino acid sequence or polypeptide. For example, a truncated 
polypeptide may be defined as a mature polypeptide (e.g., a polypeptide which lacks a leader 
sequence). A truncated polypeptide may also be defined as an amino acid sequence which is a 
25 portion of a longer sequence that has been selected for ease of expression in a heterologous 

system but retains regions which render the polypeptide useful for use in vaccines (e.g., antigenic 
regions which are expected to elicit a protective immune response). 

Additional definitions are provided throughout the specification. 
Explanation of Table 1 
30 Table 1 lists B. burgdorferi nucleotide and amino acid sequences of the present invention. 

The nomenclature used therein is as follows: 
"nt" refers to nucleotide sequences; 
"aa" refers to amino acid sequences; 

"f" refers to full-length nucleotide or amino acid sequences; and 
35 "t" refers to truncated nucleotide or amino acid sequences. 

Thus, for example, the designation "flOl.aa" refers to the full-length amino acid sequence 
of B. burgdorferi polypeptide number 101. Further, "flOLnt" refers to the full-length nucleotide 
sequence encoding the full-length amino acid sequence of B. burgdorferi polypeptide number 
101. 
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Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between the 
polypeptides of the present invention and those available through GenBank and GeneSeq 

5 databases. These reference numbers are the database entry numbers commonly used by those of 
skill in the art, who will be familar with their denominations. The descriptions of the 
numenclature for GenBank are available from the National Center for Biotechnology Information. 
Column 1 lists the gene or ORF of the present invention. Column 2 lists the accession number of 
a "match" gene sequence in GenBank or GeneSeq databases. Column 3 lists the description of 

10 the "match" gene sequence. Columns 4 and 5 are the high score and smallest sum probability, 
respectively, calculated by BLAST. Polypeptides of the present invention that do not share 
significant identity/similarity with any polypeptide sequences of GenBank and GeneSeq are not 
represented in Table 2. Polypeptides of the present invention that share significant 
identity/similarity with more than one of the polypeptides of GenBank and GeneSeq are 

1 5 represented more than once. 

Explanation of Table 3- 

The B. burgdorferi polypeptides of the present invention may include one or more conservative 
amino acid substitutions from natural mutations or human manipulation as indicated in Table 3. 
20 Changes are preferably of a minor nature, such as conservative amino acid substitutions that do 
not significantly affect the folding or activity of the protein. Residues from the following groups, 
as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, 
Acidic, and Small, 

25 Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope-bearing fragments 
present in each of the full length B. burgdorferi polypeptides described in Table 1 as predicted by 
the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4: 181-186. 
The Jameson- Wolf antigenic analysis was performed using the computer program PROTEAN 

30 (Version 3. 1 1 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, 
WI). B. burgdorferi polypeptide shown in Table 1 may one or more antigenic epitopes 
comprising residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant may vary 
slightly. The residues and locations shown described in Table 4 correspond to the amino acid 

35 sequences for each full length gene sequence shown in Table 1 and in the Sequence Listing. 
Polypeptides of the present invention that do not have antigenic epitopes recognized by the 
Jameson- Wolf algorithm are not represented in Table 2. 
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Selection of Nucleic Acid Sequences Encoding Antigenic B. burgdorferi 
Polypeptides 

The present invention provides a select number of ORFs from those presented in the 
fragments of the Borrelia burgdorferi genome which may prove useful for the generation of a 
5 protective immune response. The sequenced B. burgdorferi genomic DN A was obtained from a 
sub-cultured isolate of ATCC Deposit No. 35210. The sub-cultured isolate was deposited on 
August 8, 1997 at the American Type Culture Collection, 12301 Park Lawn Drive, Rockville, 
Maryland 20852, and given accession number 202012. 

Some ORFs contained in the subset of fragments of the B. burgdorferi genome disclosed 
10 herein were derived through the use of a number of screening criteria detailed below. The ORFs 
are generally bounded at the amino terminus by a methionine residue and at the carboxy terminus 
by a stop codon. 

Many of the selected sequences do not consist of complete ORFs. Although a polypeptide 
representing a complete ORF may be the closest approximation of a protein native to an organism, 

15 it is not always preferred to express a complete ORF in a heterologous system. It may be 

challenging to express and purify a highly hydrophobic protein by common laboratory methods. 
Some of the polypeptide vaccine candidates described herein have been modified slightly to 
simplify the production of recombinant protein. For example, nucleotide sequences which encode 
highly hydrophobic domains, such as those found at the amino terminal signal sequence, have 

20 been excluded from some constructs used for in vitro expression of the polypeptides. 

Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus 
have also been excluded from the recombinant expression constructs. Thus, in one embodiment, 
a polypeptide which represents a truncated or modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially immunogenic 

25 polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all 

theoretical Borrelia burgdorferi ORFs for several aspects of potential immunogenicity. One set of 
selection criteria are as follows: 

1. Type I signal sequence: An amino terminal type I signal sequence generally directs a 
nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. 

30 Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I 
signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and 
Kendall, D. A. Mol Microbiol 73:765-773 (1994)). The length of the type I signal sequence is 
approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in 
the extreme amino terminus. In addition, the central region of the signal sequence adopts an 

35 alpha-helical conformation in a hydrophobic environment Finally, the region surrounding the 
actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and 
-3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the several 
types of functional signal sequences which exist in addition to the type I signal sequence detailed 
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above. Although functionally related, the type IV signal sequence possesses a unique set of 
biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bacteriol. 774:7345-7351 
(1992)). These are typically six to eight amino acids with a net basic charge followed by an 
additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal 
5 sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In 
addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative 
to the cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors 
has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly 

10 three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)- 
(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C, J. 
Bioenerg. Biomembr. 22:451-471 (1990)). 

4. LPXTG motif: It has been experimentally determined that most anchored proteins 
found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal 

15 sequence. More than fifty such proteins from organisms such as S. pyogenes* S. mutans, B. 
burgdorferi, S. pneumoniae, and others, have been identified based on their extracellular location 
and carboxy terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410 (1996)). 
The conserved region consists of six charged amino acids at the extreme carboxy terminus 
coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. 

20 Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in 
nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X 
is any amino acid. 

An algorithm for selecting antigenic and immunogenic Borrelia burgdorferi polypeptides 
including the foregoing criteria was developed. The algorithm is similar to that described in U.S. 
25 patent application 08/781,986, filed January 3, 1997, which is fully incorporated by reference 
herein. Use of the algorithm by the inventors to select immunologically useful Borrelia 
burgdorferi polypeptides resulted in the selection of a number of the disclosed ORFs. 
Polypeptides comprising the polypeptides identified in this group may be produced by techniques 
standard in the art and as further described herein. 

30 

Nucleic Acid Molecules 

The present invention provides isolated nucleic acid molecules comprising polynucleotides 
encoding the B. burgdorferi polypeptides having the amino acid sequences shown in Table 1, 
which were determined by sequencing the genome of B. burgdorferi deposited as ATCC deposit 
35 no. 202012 and selected as putative immunogens. 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer (such as the Model 373 
from Applied Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of DNA sequences determined as 
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above. Therefore, as is known in the art for any DNA sequence determined by this automated 
approach, any nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more typically at 
least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the 
5 sequenced DNA molecule. The actual sequence can be more precisely determined by other 

approaches including manual DNA sequencing methods well known in the art As is also known 
in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted 
amino acid sequence encoded by a determined nucleotide sequence will be completely different 

10 from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the 
point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a 
sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide 
sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or 

15 polynucleotide, a sequence of deoxyribonucleotides, and for an RNA molecule or polynucleotide, 
the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the 
ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence of 
Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule 

20 having a sequence in which each deoxyribonucleotide A, G or C of Table 1 has been replaced by 
the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced 
by a ribonucleotide U. 

Nucleic acid molecules of the present invention may be in the form of RNA, such as 
mRNA, or in the form of DNA, including, for instance, cDN A and genomic DNA obtained by 

25 cloning or produced synthetically. The DNA may be double-stranded or single-stranded. 

Single-stranded DNA or RNA may be the coding strand, also known as the sense strand, or it 
may be the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, 
which has been removed from its native environment For example, recombinant DNA molecules 

30 contained in a vector are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present 
invention. Isolated nucleic acid molecules according to the present invention further include such 

35 molecules produced synthetically. 

In addition, isolated nucleic acid molecules of the invention include DNA molecules which 
comprise a sequence substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode a B. burgdorferi polypeptides and peptides of the 
present invention (e.g. polypeptides of Table 1). That is, all possible DNA sequences that encode 
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the B. burgdorferi polypeptides of the present invention. This includes the genetic code and 
species-specific codon preferences known in the art. Thus, it would be routine for one skilled in 
the art to generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those preferred by a 
5 mammalian or other bacterial host such as E. coll). 

The invention further provides isolated nucleic acid molecules having the nucleotide 
sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of 
the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes 
for gene mapping and for identifying J8. burgdorferi in a biological sample, for instance, by PCR, 

10 Southern blot, Northern blot, or other form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding portions or 
fragments of the nucleotide sequences described herein. Fragments include portions of the 
nucleotide sequences of Table 1 at least 10 contiguous nucleotides in length selected from any two 
integers, one of which representing a 5' nucleotide position and a second of which representing a 

15 3' nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is 

position 1. That is, every combination of a 5 ! and 3' nucleotide position that a fragment at least 
10 contiguous nucleotides in length could occupy is included in the invention. "At least" means a 
fragment may be 10 contiguous nucleotide bases in length or any integer between 10 and the 
length of an entire nucleotide sequence of Table 1 minus 1 . Therefore, included in the invention 

20 are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide 

sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of 
an entire nucleotide sequence minus 1. 

Further, the invention includes polynucleotides comprising fragments specified by size, in 
nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in 

25 contiguous nucleotides, selected from integers between 10 and the length of an entire nucleotide 
sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 
nucleotides, 40 nucleotides, 50 nucleotides. Other preferred sizes of contiguous nucleotide 
fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 
nucleotides in length which include, as discussed above, fragment sizes representing each integer 

30 between 50-300. Laiger fragments are also useful according to the present invention 

corresponding to most, if not all, of the nucleotide sequences shown in Table lor of the B. 
burgdorferi nucleotide sequences of the plasimd clones listed in Table 1 . The preferred sizes are, 
of course, meant to exemplify not limit the present invention as all size fragments, representing 
any integer between 10 and the length of an entire nucleotide sequence minus 1 , are included in 

35 the invention. Additional preferred nucleic acid fragments of the present invention include 
nucleic acid molecules encoding epitope-bearing portions of B. burgdorferi polypeptides 
identified in Table 4. 

The present invention also provides for the exclusion of any fragment, specified by 5' and 
3* base positions or by size in nucleotide bases as described above for any nucleotide sequence of 
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Table 1 or the plasimd clones listed in Table 1. Any number of fragments of nucleotide sequences 
in Table 1 or the plasimd clones listed in Table I, specified by 5' and 3' base positions or by size 
in nucleotides, as described above, may be excluded from the present invention. 

Preferred nucleic acid fragments of the present invention also include nucleic acid 
5 molecules encoding epitope-bearing portions of the B. burgdorferi polypeptides shown in Table 
1 . Such nucleic acid fragments of the present invention include, for example, nucleic acid 
molecules encoding polypeptide fragments comprising from about the amino terminal residue to 
about the caiboxy terminal residue of each fragment shown in Table 4. The above referred to 
polypeptide fragments are antigenic regions of particular B. burgdorferi polypeptides shown in 

10 Table 1 . Methods for determining other such epitope-bearing portions for the remaining 

polypeptides described in Table 1 are well known in the art and are described in detail below. 

In another aspect, the invention provides isolated nucleic acid molecules comprising 
polynucleotides which hybridize under stringent hybridization conditions to a portion of a 
polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic 

15 acid sequence shown in Table 1. By "stringent hybridization conditions" is intended overnight 
incubation at 42 C in a solution comprising: 50% formamide, 5x SSC (150 mM NaCl, 15 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lxSSC at about 65 C 

20 By polynucleotides which hybridize to a "portion" of a polynucleotide is intended 

polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably about 30-70 nt of the reference polynucleotide. These are useful as diagnostic probes 
and primers as discussed above and in more detail below. 

25 Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, 

for instance, a portion 50-100 nt in length, or even to the entire length of the reference 
polynucleotide, are also useful as probes according to the present invention, as are 
polynucleotides corresponding to most, if not all, of a nucleotide sequence as shown in Table 1 . 
By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more 

30 contiguous nucleotides from the nucleotide sequence of the reference polynucleotide (e.g., a 
nucleotide sequences as shown in Table 1). As noted above, such portions are useful 
diagnostically either as probes according to conventional DNA hybridization techniques or as 
primers for amplification of a target sequence by PCR, as described, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., 

35 eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire 
disclosure of which is hereby incorporated herein by reference. 

Since nucleic acid sequences encoding the B. burgdorferi polypeptides of the present 
invention are provided in Table 1, generating polynucleotides which hybridize to portions of these 
sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides 
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of the present invention could be generated synthetically according to known techniques. 

As indicated, nucleic acid molecules of the present invention which encode B. burgdorferi 
polypeptides of the present invention may include, but are not limited to those encoding the amino 
acid sequences of the polypeptides by themselves; and additional coding sequences which code 
5 for additional amino acids, such as those which provide additional functionalities. Thus, the 
sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence 
encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which 
10 are commercially available. As described in Gentz etaL, Proc. Natl. Acad. ScL USA 56:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the resulting fusion 
protein. 

Thus, the present invention also includes genetic fusions wherein the 
B. burgdorferi nucleic acid sequences coding sequences provided in Table 1 are linked to 

15 additional nucleic acid sequences to produce fusion proteins. These fusion proteins may include 
epitopes of borrelial or non-borrclial origin designed to produce proteins having enhanced 
inununogenicity. Further, the fusion proteins of the present invention may contain antigenic 
determinants known to provide helper T-cell stimulation, peptides encoding sites for 
posMranslational modifications which enhance immunogenicity (e.g., acylation), peptides which 

20 facilitate purification (e.g., histidine "tag"), or amino acid sequences which target the fusion 

protein to a desired location (e.g., a heterologous leader sequence). For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Natl. 
Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for purification which corresponds 
to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell 

25 37:767. As discussed below, other such fusion proteins include the B. burgdorferi polypeptides 
of the present invention fused to Fc at the N- or C-terminus. 

Post-translational modification of the full-length B. burgdorferi OspA protein 
expressed in E. coli is believed to increase the immunogenicity of this protein. Erdile, L. et aL, 
Infect Immuru 67:81-90 (1993). B. burgdorferi OspA when expressed in E. coli 9 for example, 

30 is post-translationally modified in at least two ways. First, a signal peptide is cleaved; second, 
lipid moieties are attached The presence of these lipid moieties is believed to confer enhanced 
immunogenicity and results in the elicitation of a strong protective immunological response. 

Variant and Mutant Polynucleotides 
35 The present invention thus includes nucleic acid molecules and sequences which encode 

fusion proteins comprising one or more 5. burgdorferi polypeptides of the present invention 
fused to an amino acid sequence which allows for post-translational modification to enhance 
immunogenicity. This post-translational modification may occur either in vitro or when the fusion 
protein is expressed in vivo in a host cell. An example of such a modification is the introduction 
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of an amino acid sequence which results in the attachment of a lipid moiety. Such a lipid moiety 
attachment site of OspA, which is lipidated upon expression in E coli, has been identified. 
Bouchon, B. et al t Anal Biochem. 246:52-61 (1997). 

Thus, as indicated above, the present invention includes genetic fusions wherein a 
5 B. burgdorferi nucleic acid sequence provided in Table 1 is linked to a nucleotide sequence 
encoding another amino acid sequence. These other amino acid sequences may be of borrelial 
origin (e.g., another sequence selected from Table 1) or non-borrelial origin. An example of such 
a fusion protein is reported in Fikrig, E. et aU Science 250:553-556 (1990) where an OspA- 
glutathione-S-transferase fusion protein was produced and shown to elicit protective immunity 

10 against Lyme disease in immune competent mice. 

The present invention further relates to variants of the nucleic acid molecules of the present 
invention, which encode portions, analogs or derivatives of the B. burgdorferi polypeptides 
shown in Table 1. Variants may occur naturally, such as a natural allelic variant By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 

15 chromosome of an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985). 
Non-naturaUy occurring variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions or additions. 
The substitutions, deletions or additions may involve one or more nucleotides. These variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions 

20 may produce conservative or non-conservative amino acid substitutions, deletions or additions. 
Especially preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of the B. burgdorferi polypeptides disclosed herein or portions 
thereof. Also especially preferred in this regard are conservative substitutions. 

The present application is further directed to nucleic acid molecules at least 90%, 95%, 

25 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1. The above 

nucleic acid sequences are included irrespective of whether they encode a polypeptide having B. 
burgdorferi activity. This is because even where a particular nucleic acid molecule does not 
encode a polypeptide having B. burgdorferi activity, one of skill in the art would still know how 
to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid 

30 molecules of the present invention that do not encode a polypeptide having B. burgdorferi activity 
include, inter alia, isolating an B. burgdorferi gene or allelic variants thereof from a DNA library, 
and detecting B. burgdorferi mRNA expression samples, environmental samples, suspected of 
containing B. burgdorferi by Northern Blot analysis. 

Embodiments of the invention include isolated nucleic acid molecules comprising a 

35 polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 
95%, 96%, 97%, 98% or 99% identical to (a) a nucleotide sequence encoding any of the amino 
acid sequences of the full-length polypeptides shown in Table 1; (b) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1 but 
minus the N-tenninal methionine residue, if present; (c) a nucleotide sequence encoding any of the 
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amino acid sequences of the truncated polypeptides shown in Table 1; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b), or (c) above. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 
98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a 
5 polypeptide having B. burgdorferi protein activity By M a polypeptide having B. burgdorferi 
activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an 
activity of the B. burgdorferi protein of the invention, as measured in a particular biological assay 
suitable for measuring activity of the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 

10 recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 
96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table 1 will encode a 
polypeptide having B. burgdorferi protein activity. In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even 
without performing the above described comparison assay. It will be further recognized in the art 

15 that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having B. burgdorferi protein activity. This is because the skilled 
artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly effect protein function (e.g., replacing one aliphatic amino acid with a second 
aliphatic amino acid), as further described below. 

20 The biological activity or function of the polypeptides of the present invention are expected 

to be similar or identical to polypeptides from other bacteria that share a high degree of structural 
identity/similarity. Tables 2 lists accession numbers and descriptions for the closest matching 
sequences of polypeptides available through Genbank and Derwent databases. It is therefore 
expected that the biological activity or function of the polypeptides of the present invention will be 

25 similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in 
Table 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% "identical" to 
a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence 
of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence 

30 may include up to five point mutations per each 100 nucleotides of the reference nucleotide 
sequence encoding the B. burgdorferi polypeptide. In other words, to obtain a polynucleotide 
having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% 
(5 of 100) of the nucleotides in the reference sequence may be deleted, inserted, or substituted 
with another nucleotide. The query sequence may be an entire sequence shown in Table 1, the 

35 ORF (open reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention 
can be determined conventionally using known computer programs. A preferred method for 
determining the best overall match between a query sequence (a sequence of the present invention) 
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and a subject sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et ah 
(1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can be compared by first converting U*s to T's. 
5 The result of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 
tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

10 If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, 

not because of internal deletions, a manual correction must be made to the results. This is because 
the FASTDB program does not account for 5' and 3* truncations of the subject sequence when 
calculating percent identity. For subject sequences truncated at the 5* or 3' ends, relative to the 
query sequence, the percent identity is corrected by calculating the number of bases of the query 

15 sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to arrive at a 
final percent identity score. This corrected score is what is used for the purposes of the present 

20 invention. Only nucleotides outside the 5* and 3' nucleotides of the subject sequence, as 

displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject sequence 

25 and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 

nucleotides at 5* end. The 10 unpaired nucleotides represent 10% of the sequence (number of 
nucleotides at the 5' and 3* ends not matched/total number of nucleotides in the query sequence) 
so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 nucleotides were perfectly matched the final percent identity would be 90%. In 

30 another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query 
sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5' 
or 3' of the subject sequence which are not matched/aligned with the query. In this case the 
percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5' and 3' of the subject sequence which are not matched/aligned with the query sequence are 

35 manually corrected for. No other manual corrections are to made for the purposes of the present 
invention. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated DNA molecules of 
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the present invention, host cells which are genetically engineered with the recombinant vectors, 
and the production of B. burgdorferi polypeptides or fragments thereof by recombinant 
techniques. 

Recombinant constructs may be introduced into host cells using well known techniques 
5 such as infection, transduction, transfection, transvection, electroporation and transformation. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors 
may be replication competent or replication defective. In the latter case, viral propagation 
generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 

10 propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium 
phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be 
packaged in vitro using an appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising as-acting control regions to the polynucleotide of 
interest Appropriate trans-acting factors may be supplied by the host, supplied by a 

15 complementing vector or supplied by the vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for specific 
expression, which may be inducible and/or cell type-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. 

20 Expression vectors useful in the present invention include chromosomal-, episomal- and 

virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast 
episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors 
derived from combinations thereof, such as cosmids and phagemids. 

25 The DNA insert should be operatively linked to an appropriate promoter, such as the 

phage lambda PL promoter, the E. coli lac, trp and tac promoters, the SV40 early and late 
promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be 
known to the skilled artisan. The expression constructs will further contain sites for transcription 
initiation, termination and, in the transcribed region, a ribosome binding site for translation. The 

30 coding portion of the mature transcripts expressed by the constructs will preferably include a 
translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable marker. 
Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture 

35 and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. 

Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such 
as E coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the 
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above-described host cells are known in the art 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available 
from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, 
pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, 
5 pKK223-3, pKK233-3, pDR540, pRTTS available from Pharmacia. Among preferred eukaiyotic 
vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be 
readily apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention include the £. 

10 coli lad and lacL promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR and PL 
promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs, such as those of the Rous sarcoma virus (RS V), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. 

15 Introduction of the construct into the host cell can be effected by calcium phosphate 

transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et al, Basic Methods In Molecular Biology (1986). 
Transcription of DNA encoding the polypeptides of the present invention by higher 

20 eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are 
cw-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional 
activity of a promoter in a given host cell-type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 

25 origin, and adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, 
into the periplasmic space or into the extracellular environment, appropriate secretion signals may 
be incorporated into the expressed polypeptide. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

30 The polypeptide may be expressed in a modified form, such as a fusion protein, and may 

include not only secretion signals, but also additional heterologous functional regions. For 
instance, a region of additional amino acids, particularly charged amino acids, may be added to 
the N-terminus of the polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties may be added to 

35 the polypeptide to facilitate purification. Such regions may be removed prior to final preparation 
of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, are familiar and routine 
techniques in the art A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-0 464 533 (Canadian 
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counterpart 2045869) discloses fusion proteins comprising various portions of constant region of 
immunoglobin molecules together with another human protein or part thereof. In many cases, the 
Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 
5 hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein 
has been expressed, detected and purified in the advantageous manner described. This is the case 
when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the 
fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, hIL5-receptor has been fused with Fc portions for the purpose of 
10 high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al, J. 
Molec. Recogn. 8:52-58 (1995) and Johanson, K. etaU Biol Chem. 270 (7<5J:9459-9471 
(1995). 

The B. burgdorferi polypeptides can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 

15 extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") 
is employed for purification. Polypeptides of the present invention include naturally purified 
products, products of chemical synthetic procedures, and products produced by recombinant 

20 techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 

Polypeptides and Fragments 

The invention further provides isolated polypeptides having the amino acid sequences in 

25 Table 1, and peptides or polypeptides comprising portions of the above polypeptides. The terms 
"peptide" and "oligopeptide" are considered synonymous (as is commonly recognized) and each 
term can be used interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polypeptide" is used herein for chains containing 
more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences 

30 herein are written from left to right and in the direction from amino terminus to carboxy terminus. 

As discussed in detail below, immunization using B. burgdorferi sensu stricto isolate B3 1 
decorin-binding protein elicits the production of antiserum which confers passive immunity 
against Borrelia species and strains which express divergent forms of this protein. Cassatt, D. et 
aL, Protection of Borrelia burgdorferi Infection by Antibodies to Decorin-binding Protein, in 

35 Vaccines97, Cold Spring Harbor Press (1997), pages 191-195. Thus, some amino acid 
sequences of the 5. burgdorferi polypeptides shown in Table 1 can be varied without 
significantly effecting the antigenicity of the polypeptides. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the polypeptide which 
determine antigenicity. In general, it is possible to replace residues which do not form part of an 
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antigenic epitope without significantly effecting the antigenicity of a polypeptide. 
Variant and Mutant Polypeptides 

To improve or alter the characteristics of B. burgdorferi polypeptides of the present 
invention, protein engineering may be employed. Recombinant DNA technology known to those 
5 skilled in the art can be used to create novel mutant proteins or muteins including single or 
multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified 
polypeptides can show, e.g., enhanced activity or increased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. 

10 

N -Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. 
Chem., 268:2984-2988 (1993), reported modified KGF proteins that had heparin binding 

15 activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the 
present invention provides polypeptides having one or more residues deleted from the amino 
terminus of the amino acid sequence of the B. burgdorferi polypeptides shown in Table 1, and 
polynucleotides encoding such polypeptides. 

Similarly, many examples of biologically functional C-terminal deletion muteins are 

20 known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 
amino acid residues from the carboxy terminus of the protein See, e.g., Dobeli, et al. (1988) J. 
Biotechnology 7:199-216. Accordingly, the present invention provides polypeptides having one 
or more residues from the carboxy terminus of the amino acid sequence of the B. burgdorferi 
polypeptides shown in Table 1. The invention also provides polypeptides having one or more 

25 amino acids deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions or fragments 
of the amino acid sequences described herein as well as to portions or fragments of the isolated 
amino acid sequences described herein. Fragments include portions of the amino acid sequences 
of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one 

30 of which representing a N-terminal position. The initiation codon of the polypeptides of the 
present inventions position 1. Every combination of a N-terminal and C-terminal position that a 
fragment at least 5 contiguous amino acid residues in length could occupy, on any given amino 
acid sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of residues in a 

35 full length amino acid sequence minus 1. Therefore, included in the invention are contiguous 
fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth 
in Table 1 wherein the contiguous fragment is any integer between 5 and the number of residues 
in a full length sequence minus 1. 

Further, the invention includes polypeptides comprising fragments specified by size, in 
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amino acid residues, rather than by N-terminal and C-terminal positions. The invention includes 
any fragment size, in contiguous amino acid residues, selected from integers between 5 and the 
number of residues in a full length sequence minus 1. Preferred sizes of contiguous polypeptide 
fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino 
5 acid residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid 
residues, about 100 amino acid residues, about 200 amino acid residues, about 300 amino acid 
residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any integer between 5 
and the number of residues in a full length sequence minus 1 are included in the invention. The 

10 present invention also provides for the exclusion of any fragments specified by N-terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 
fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as 
described above may be excluded. 

The above fragments need not be active since they would be useful, for example, in 

15 immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion 
of the protein, as vaccines, and as molecular weight markers. 

Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, it also will 

20 be recognized by one of ordinary skill in the art that some amino acid sequences of the B. 

burgdorferi polypeptide can be varied without significant effect of the structure or function of the 
protein. If such differences in sequence are contemplated, it should be remembered that there will 
be critical areas on the protein which determine activity. 

Thus, the invention further includes variations of the B. burgdorferi polypeptides which 

25 show substantial B. burgdorferi polypeptide activity or which include regions of B. burgdorferi 
protein such as the protein portions discussed below. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions selected according to general rules known in the art so 
as to have little effect on activity. For example, guidance concerning how to make phenotypically 
silent amino acid substitutions is provided. There are two main approaches for studying the 

30 tolerance of an amino acid sequence to change. See, Bowie, J. U. et al. (1990), Science 

247:1306-1310. The first method relies on the process of evolution, in which mutations are either 
accepted or rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or screens to 
identify sequences that maintain functionality. 

35 These studies have revealed that proteins are surprisingly tolerant of amino acid 

substitutions. The studies indicate which amino acid changes are likely to be permissive at a 
certain position of the protein. For example, most buried amino acid residues require nonpolar 
side chains, whereas few features of surface side chains are generally conserved. Other such 
phenotypically silent substitutions are described by Bowie et al. (supra) and the references cited 
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therein. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and lie; interchange of the hydroxyl residues Ser 
and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues 
Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 
5 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 1, or that 
encoded by the plaimds listed in Table 1 , may be: (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be one 

10 encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes 
a substituent group: or (iii) one in which the B. burgdorferi polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol): or (iv) one in which the additional amino acids are fused to the above form 
of the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a 

15 sequence which is employed for purification of the above form of the polypeptide or a proprotein 
sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those 
skilled in the art from the teachings herein. 

Thus, the B. burgdorferi polypeptides of the present invention may include one or more 
amino acid substitutions, deletions, or additions, either from natural mutations or human 

20 manipulation. As indicated, changes are preferably of a minor nature, such as conservative amino 
acid substitutions that do not significandy affect the folding or activity of the protein (see Table 3). 

Amino acids in the B. burgdorferi proteins of the present invention that are essential for 
function can be identified by methods known in the art, such as site-directed mutagenesis or 
alanine-scanning mutagenesis. See, e.g., Cunningham et al. (1989) Science 244: 1081-1085. 

25 The latter procedure introduces single alanine mutations at every residue in the molecule. The 
resulting mutant molecules are then tested for biological activity using assays appropriate for 
measuring the function of the particular protein. 

Of special interest are substitutions of charged amino acids with other charged or neutral 
amino acids which may produce proteins with highly desirable improved characteristics, such as 

30 less aggregation. Aggregation may not only reduce activity but also be problematic when 
preparing pharmaceutical formulations, because aggregates can be immunogenic. See, e.g., 
Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838- 
845; Cleland, et al., (1993) Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an isolated form, and 

35 preferably are substantially purified A recombinantly produced version of the B. burgdorferi 
polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) 
Gene 67:3 1-40. Polypeptides of the invention also can be purified from natural or recombinant 
sources using antibodies directed against the polypeptides of the invention in methods which are 
well known in the art of protein purification. 
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The invention further provides for isolated fi. burgdorferi polypeptides comprising an 
amino acid sequence selected from the group consisting of: (a) the amino acid sequence of a full- 
length 5. burgdorferi polypeptide having the complete amino acid sequence shown in Table 1; 
(b) the amino acid sequence of a full-length B. burgdorferi polypeptide having the complete amino 
5 acid sequence shown in Table 1 excepting the N-terminal methionine; (c) the complete amino acid 
sequence encoded by the plaimds listed in Table 1 ; and (d) the complete amino acid sequence 
excepting the N-terminal methionine encoded by the plaimds listed in Table 1 . The polypeptides 
of the present invention also include polypeptides having an amino acid sequence at least 80% 
identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% 

10 or 99% identical to those described in (a), (b), (c), and (d) above. 

Further polypeptides of the present invention include polypeptides which have at least 
90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 
97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises the amino 

15 acid sequence of a B. burgdorferi polypeptide having an amino acid sequence which contains at 
least one conservative amino acid substitution, but not more than 50 conservative amino acid 
substitutions, not more than 40 conservative amino acid substitutions, not more than 30 
conservative amino acid substitutions, and not more than 20 conservative amino acid 
substitutions. Also provided are polypeptides which comprise the amino acid sequence of a B. 

20 burgdorferi polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
conservative amino acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to 
a query amino acid sequence of the present invention, it is intended that the amino acid sequence 
of the subject polypeptide is identical to the query sequence except that the subject polypeptide 

25 sequence may include up to five amino acid alterations per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 

30 reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 96%, 97%, 
98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino 

35 acid sequence encoded by the plaimds listed in Table 1 can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the 
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query and subject sequences are both amino acid sequences. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment 
are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 
5 Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is 
shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be manually 
corrected. This is because the FASTDB program does not account for N- and C-terminal 

10 truncations of the subject sequence when calculating global percent identity. For subject 

sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C-terminal 
of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a 
percent of the total bases of the query sequence. Whether a residue is matched/aligned is 

15 determined by results of the FASTDB sequence alignment. This percentage is then subtracted 
from the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score is what is 
used for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are considered for the 

20 purposes of manually adjusting the percent identity score. That is, only query amino acid 
residues outside the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query 
sequence to determine percent identity. The deletion occurs at the N-terminus of the subject 
sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at 

25 the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at 
the N- and C- termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the remaining 
90 residues were perfecdy matched the final percent identity would be 90%. In another example, 
a 90 residue subject sequence is compared with a 100 residue query sequence. This time the 

30 deletions are internal so there are no residues at the N- or C-termini of the subject sequence which 
are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is 
not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of 
the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 
with the query sequence are manually corrected. No other manual corrections are to made for the 

35 purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have their 
normal biological activity. This is because even where a particular polypeptide molecule does not 
have biological activity, one of skill in the art would still know how to use the polypeptide, for 
instance, as a vaccine or to generate antibodies. Other uses of the polypeptides of the present 
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invention that do not have B. burgdorferi activity include, inter alia, as epitope tags, in epitope 
mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel 
filtration columns using methods known to those of skill in the art 

As described below, the polypeptides of the present invention can also be used to raise 
5 polyclonal and monoclonal antibodies, which are useful in assays for detecting B, burgdorferi 
protein expression or as agonists and antagonists capable of enhancing or inhibiting B. 
burgdorferi protein function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" B. burgdorferi protein binding proteins which are also candidate agonists and 
antagonists according to the present invention. See, e.g., Fields et al. (1989) Nature 
10 340:245-246. 

Epitope-B earing Portions 

In another aspect, the invention provides peptides and polypeptides comprising 
epitope-bearing portions of the B. burgdorferi polypeptides of the present invention. These 

15 epitopes are immunogenic or antigenic epitopes of the polypeptides of the present invention. An 
"immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the 
whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be 
confined to a few loci on the molecule. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The 

20 number of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 
Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists 
amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The 
polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not 

25 considered non-antigenic. This is because they may still be antigenic in vivo but merely not 
recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues 
comprising preferred antigenic epitopes but not a complete list Amino acid residues comprising 
other anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf analysis or 
by in vivo testing for an antigenic response using the methods described herein or those known in 

30 the art 

As to the selection of peptides or polypeptides bearing an antigenic epitope (Le. y that 
contain a region of a protein molecule to which an antibody can bind), it is well known in that art 
that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable 
of eliciting an antiserum that reacts with the partially mimicked protein. See, e.g., Sutcliffe, et 
35 al., (1983) Science 219:660-666. Peptides capable of eliciting protein-reactive sera are frequently 
represented in the primary sequence of a protein, can be characterized by a set of simple chemical 
rules, and are confined neither to immunodominant regions of intact proteins (Le., immunogenic 
epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective at inducing antibodies that bind to the 
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mimicked protein; longer, peptides, especially those containing proline residues, usually are 
effective. See, Sutcliffe, et al., supra, p. 661. For instance, 18 of 20 peptides designed 
according to these guidelines, containing 8-39 residues covering 75% of the sequence of the 
influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the 
5 HA1 protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the 
rabies glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful 
to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the 
invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors 

10 immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the 
native protein. See Sutcliffe, et aL, supra, p. 663. The antibodies raised by antigenic 
epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies 
to different peptides may be used for tracking the fate of various regions of a protein precursor 
which undergoes post-translational processing. The peptides and anti-peptide antibodies may be 

15 used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in 
competition assays since it has been shown that even short peptides (e.g., about 9 amino acids) 
can bind and displace the larger peptides in immunoprecipitation assays. See, e.g., Wilson, et 
al., (1984) Cell 37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography using methods 

20 known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention designed according 
to the above guidelines preferably contain a sequence of at least seven, more preferably at least 
nine and most preferably between about 10 to about 50 amino acids (i.e. any integer between 7 
and 50) contained within the amino acid sequence of a polypeptide of the invention. However, 

25 peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide 
of the invention, containing about 50 to about 100 amino acids, or any length up to and including 
the entire amino acid sequence of a polypeptide of the invention, also are considered 
epitope-bearing peptides or polypeptides of the invention and also are useful for inducing 
antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the 

30 epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents (i.e., the 
sequence includes relatively hydrophilic residues and highly hydrophobic sequences are 
preferably avoided); and sequences containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate 
an Borrelia-specific immune response or antibodies include portions of the amino acid sequences 

35 identified in Table 1. More specifically, Table 4 discloses a list of non-limiting residues that are 
involved in the antigenicity of the epitope-bearing fragments of the present invention. Therefore, 
the present inventions provides for isolatd and purified antigenic epitope-bearing firagements of 
the polypeptides of the present invention comprising a peptide sequences of Table 4. The 
antigenic epitope-bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
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sequence of at least seven, more preferably at least nine and most preferably between about 10 to 
about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide of the present 
invention. That is, included in the present invention are antigenic polypeptides between the 
integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4. 
5 Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic 
polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length 
comprising one or more of the sequences of Table 4 are included. The antigenic epitope-bearing 
firagements may be specified by either the number of contiguous amino acid residues or by 
specific N-terminal and C-terminal positions as described above for the polypeptide firagements of 

10 the present invention, wherein the initiation codon is residue 1. Any number of the described 
antigenic epitope-bearing firagements of the present invention may also be excluded from the 
present invention in the same manner. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any 
conventional means for making peptides or polypeptides including recombinant means using 

15 nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of 
the present invention may be fused to a larger polypeptide which acts as a carrier during 
recombinant production and purification, as well as during immunization to produce anti-peptide 
antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical 
synthesis. For instance, Houghten has described a simple method for synthesis of large numbers 

20 of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid 
variants of a segment of the HA1 polypeptide which were prepared and characterized (by 
EUSA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. 
USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process 
is further described in U.S. Patent No. 4,631,21 1 to Houghten and coworkers (1986). In this 

25 procedure the individual resins for the solid-phase synthesis of various peptides are contained in 
separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps 
involved in solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. Sci. 
82:5131-5135 at 5134. 

30 Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies 

according to methods well known in the art. See, e.g., Sutcliffe, et al., supra;; Wilson, et al., 
supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. Generally, animals may be 
immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of 
the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus 

35 toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such 
as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as 
rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 ^ig peptide or 
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carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by selection of 
5 anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of 
the selected antibodies according to methods well known in the art 

Immunogenic epitope-bearing peptides of the invention, Le. 9 those parts of a protein that 
elicit an antibody response when the whole protein is the immunogen, are identified according to 
methods known in the art. For instance, Geysen, et al , supra, discloses a procedure for rapid 

10 concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an 
ELISA. Interaction of synthesized peptides with antibodies is then easily detected without 
removing them from the support. In this manner a peptide bearing an immunogenic epitope of a 
desired protein may be identified routinely by one of ordinary skill in the art. For instance, the 
immunologically important epitope in the coat protein of foot-and-mouth disease virus was located 

15 by Geysen et al supra with a resolution of seven amino acids by synthesis of an overlapping set 
of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. 
Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn 
at every position within the epitope were synthesized, and the particular amino acids conferring 
specificity for the reaction with antibody were determined. Thus, peptide analogs of the 

20 epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an 
immunogenic epitope of a desired protein. 

Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method 
of detecting or determining the sequence of monomers (amino acids or other compounds) which 

25 is a topological equivalent of the epitope (Le. , a "mimotope") which is complementary to a 

particular paratope (antigen binding site) of an antibody of interest More generally, U.S. Patent 
No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a 
sequence of monomers which is a topographical equivalent of a ligand which is complementary to 
the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 

30 to Houghten, R. A. et al (1996) discloses linear Cj-Cj-alkyl peralkylated oligopeptides and sets 
and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries 
for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor 
molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention 
also can be made routinely by these methods. The entire disclosure of each document cited in this 

35 section on "Polypeptides and Fragments" is hereby incorporated herein by reference. 

As one of skill in the art will appreciate, the polypeptides of the present invention and the 
epitope-bearing fragments thereof described above can be combined with parts of the constant 
domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. This has been shown, e.g., for 
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chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. 
(EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and 
5 neutralizing other molecules than a monomelic B. burgdorferi polypeptide or fragment thereof 
alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. Nucleic acids encoding the 
above epitopes of B. burgdorferi polypeptides can also be recombined with a gene of interest as 
an epitope tag to aid in detection and purification of the expressed polypeptide. 

10 Antibodies 

B. burgdorferi protein-specific antibodies for use in the present invention can be raised 
against the intact B. burgdorferi protein or an antigenic polypeptide fragment thereof, which may 
be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. 

15 As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 

include intact molecules, single chain whole antibodies, and antibody fragments. Antibody 
fragments of the present invention include Fab and F(ab')2 and other fragments including single- 
chain Fvs (scFv) and disulfide-linked Fvs (sdFv). Also included in the present invention are 
chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the 

20 polypeptides of the present invention. The antibodies of the present invention may be prepared by 
any of a variety of methods. For example, cells expressing a polypeptide of the present invention 
or an antigenic fragment thereof can be administered to an animal in order to induce the production 
of sera containing polyclonal antibodies. For example, a preparation of B. burgdorferi 
polypeptide or fragment thereof is prepared and purified to render it substantially free of natural 

25 contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

In a preferred method, the antibodies of the present invention are monoclonal antibodies or 
binding fragments thereof. Such monoclonal antibodies can be prepared using hybridoma 
technology. See, e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold 

30 Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL 
ANTIBODIES AND T-CELL HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and 
F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, B. burgdorferi 
polypeptide-binding fragments, chimeric, and humanized antibodies can be produced through the 

35 application of recombinant DNA technology or through synthetic chemistry using methods known 
in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide antigen of the 
present invention may be produced in a two-step procedure through the use of anti-idiotypic 
antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, 
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therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance 
with this method, B. burgdorferi polypeptide-specific antibodies are used to immunize an animal, 
preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma ceUs, 
and the hybridoma cells are screened to identify clones which produce an antibody whose ability 
5 to bind to the B. burgdorferi polypeptide-specific antibody can be blocked by the B. burgdorferi 
polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the B. burgdorferi 
polypeptide-specific antibody and can be used to immunize an animal to induce formation of 
further B. burgdorferi polypeptide-specific antibodies. 

Antibodies and firagements thereof of the present invention may be described by the 

10 portion of a polypeptide of the present invention recognized or specifically bound by the antibody. 
Antibody binding firagements of a polypeptide of the present invention may be described or 
specified in the same manner as for polypeptide firagements discussed above., i.e, by N-terminal 
and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody 
binding fragments, of a polypeptide of the present invention, specified by N-terminal and C- 

15 terminal positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the specifically 
bind a particuarlly discribed firagement of a polypeptide of the present invention and allows for the 
exclusion of the same. 

Antibodies and firagements thereof of the present invention may also be described or 

20 specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 

polypeptides of any other species of Borrelia other than B. burgdorferi are included in the present 
invention. Likewise, antibodies and fragements that bind only species of Borrelia, i.e. antibodies 
and fragements that do not bind bacteria from any genus other than Borrelia, are included in the 
present invention. 

25 

Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal infection in 
an animal by detecting the expression of genes encoding staphylococcal polypeptides of the 
present invention. The methods comprise analyzing tissue or body fluid from the animal for 

30 Ztorrrfia-specific antibodies, nucleic acids, or proteins. Analysis of nucleic acid specific to 
Borrelia is assayed by PCR or hybridization techniques using nucleic acid sequences of the 
present invention as either hybridization probes or primers. See, e.g., Sambrook et al. Molecular 
cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing differentiation 

35 among spotted fever group Rickettsiae species by analysis of restriction fragment length 

polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 
(detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Borrelia has already been 
made, the present invention is useful for monitoring progression or regression of the disease state 
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whereby patients exhibiting enhanced Borrelia gene expression will experience a worse clinical 
outcome relative to patients expressing these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, cell 
line, tissue culture, or other source which contains Borrelia polypeptide, mRNA, or DNA. 

5 Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial 
fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other biological source suspected 
of containing Borrelia polypeptides or nucleic acids. Methods for obtaining biological samples 
such as tissue are well known in the art. 

The present invention is useful for detecting diseases related to Borrelia infections in 

10 animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, 
rabbits and humans. Particularly preferred are humans. 

Total RNA can be isolated from a biological sample using any suitable technique such as 
the single-step giianidinium-thiocyanate-phenol-chloroform method described in Chomczynski et 
al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Borrelia polypeptides having sufficient 

15 homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between 
complementary sequences are then assayed using any appropriate method. These include 
Northern blot analysis, SI nuclease mapping, the polymerase chain reaction (PCR), reverse 
transcription in combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

20 Northern blot analysis can be performed as described in Harada et al. (1990) Cell 

63:303-312. Briefly, total RNA is prepared from a biological sample as described above. For the 
Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl 
sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred 
onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter 

25 is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon 
sperm, SDS, and sodium phosphate buffer. A B. burgdorferi polynucleotide sequence shown in 
Table 1 labeled according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and 
exposed to x-ray film. DNA for use as probe according to the present invention is described in 

30 the sections above and will preferably at least 15 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To 
prepare probe DNA for use in S 1 mapping, the sense strand of an above-described B. burgdorferi 
DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. 
The antisense DNA can then be digested using an appropriate restriction endonuclease to generate 

35 further DNA probes of a desired length. Such antisense probes are useful for visualizing 

protected bands corresponding to the target mRNA (Le. 9 mRNA encoding Borrelia polypeptides). 

Levels of mRNA encoding Borrelia polypeptides are assayed, for e.g., using the RT-PCR 
method described in Makino et al. (1990) Technique 2:295-301. By this method, the 
radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial 
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concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from 
a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After 
incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, 
DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
5 of the RNA, the RT products are then subject to PCR using labeled primers. Alternatively, rather 
than labeling the primers, a labeled dNTP can be included in the PCR reaction mixture. PCR 
amplification can be performed in a DNA thermal cycler according to conventional techniques. 
After a suitable number of rounds to achieve amplification, the PCR reaction mixture is 
electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate 

10 bands (corresponding to the mRNA encoding the Borrelia polypeptides of the present invention) 
are quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, 
reagent and gel concentrations, and labeling methods are well known in the art Variations on the 
RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can detect the 
nucleic acid of the present invention can be found in PCR PRIMER: A LABORATORY 

15 MANUAL (C.W. Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, may be used 
to detect polynucleotides of the present invention or Borrelia species including B. burgdorferi 
using bio chip technology. The present invention includes both high density chip arrays (>1000 
oligonucleotides per cm 2 ) and low density chip arrays (<1000 oligonucleotides per cm 2 ). Bio 

20 chips comprising arrays of polynucleotides of the present invention may be used to detect Borrelia 
species, including B. burgdorferi, in biological and environmental samples and to diagnose an 
animal, including humans, with an B. burgdorferi or other Borrelia infection. The bio chips of 
the present invention may comprise polynucleotide sequences of other pathogens including 
bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide 

25 sequences of the present invention, for use in rapid dif fenertial pathogenic detection and 
diagnosis. The bio chips can also be used to monitor an B. burgdorferi or other Borrelia 
infections and to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response 
to drug therapy in the clinic and drug development in the laboratory. The bio chip technology 
comprising arrays of polynucleotides of the present invention may also be used to simultaneously 

30 monitor the expression of a multiplicity of genes, including those of the present invention. The 
polynucleotides used to comprise a selected array may be specified in the same manner as for the 
fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. 
Methods and particular uses of the polynucleotides of the present invention to detect Borrelia 
species, including B. burgdorferi y using bio chip technology include those known in the art and 

35 those of: U.S. Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 
5527681, 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/951 1995, WO/9743447, WO/9535505, each incorporated herein in their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to detect, 
diagnose, and monitor B. burgdorferi or other Borrelia species and infections thereof. 
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Biosensors using the polynucleotides of the present invention may also be used to detect particular 
polynucleotides of the present invention. Biosensors using the polynucleotides of the present 
invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, 
etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods 
5 and particular uses of the polynucleotides of the present invention to detect Borrelia species, 
including B. burgdorferi, using biosenors include those known in the art and those of: U.S. 
Patent Nos 5721 102, 5658732, 5631 170, and World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 

10 polynucleotides of the present invention and methods of their use. 

Assaying Borrelia polypeptide levels in a biological sample can occur using any art-known 
method, such as antibody-based techniques. For example, Borrelia polypeptide expression in 
tissues can be studied with classical immunohistological methods. In these, the specific 
recognition is provided by the primary antibody (polyclonal or monoclonal) but the secondary 

15 detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a 
result, an immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Borrelia 
polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. Cell. 
Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 105:3087-3096. In this technique, 

20 which is based on the use of cationic solid phases, quantitation of a Borrelia polypeptide can be 
accomplished using an isolated Borrelia polypeptide as a standard. This technique can also be 
applied to body fluids. 

Other antibody-based methods useful for detecting Borrelia polypeptide gene expression 
include immunoassays, such as the ELIS A and the radioimmunoassay (RIA). For example, a 

25 Borrelia polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and 
as an enzyme-labeled probe to detect and quantify a Borrelia polypeptide. The amount of a 
Borrelia polypeptide present in the sample can be calculated by reference to the amount present in 
a standard preparation using a linear regression computer algorithm. Such an ELIS A is described 
in Iacobelli et al. (1988) Breast Cancer Research and Treatment 1 1: 19-30. In another ELISA 

30 assay, two distinct specific monoclonal antibodies can be used to detect Borrelia polypeptides in a 
body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as 
the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or "two-step" assay. 
The "one-step" assay involves contacting the Borrelia polypeptide with immobilized antibody and, 

35 without washing, contacting the mixture with the labeled antibody. The "two-step" assay 
involves washing before contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to immobilize one component 
of the assay system on a support, thereby allowing other components of the system to be brought 
into contact with the component and readily removed from the sample. Variations of the above 
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and other immunological methods included in the present invention can also be found in Harlow 
et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nded. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, which 
5 catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is 
particularly preferred as it has good stability and its substrate (glucose) is readily available. 
Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide 
formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels 
include radioisotopes, such as iodine ( ,25 I, ,21 I), carbon ( l4 C), sulphur ( 35 S), tritium ( 3 H), indium 
10 ( ,l2 In), and technetium ("Tc), and fluorescent labels, such as fluorescein and rhodamine, and 
biottn. 

Further suitable labels for the Borrelia polypeptide-specific antibodies of the present 
invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, 
Borrelia nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol 

15 phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, m In, ,25 I, l31 1, 32 P, 35 S, 14 C, 5I Cr, 
57 To, 58 Co, 59 Fe, 75 Se, I52 Eu, *°Y, 67 Cu, 2,7 Ci, 2n At, 2,2 Pb, 47 Sc, 109 Pd, etc. M, In is a preferred 

20 isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the l25 I or 
l3l I-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more 
favorable gamma emission energy for imaging. See, e.g., Perkins et al. (1985) Eur. J. Nucl. 
Med. 10:296-301 ; Carasquillo et al. (1987) J. Nucl. Med. 28:281-287. For example, m In 
coupled to monoclonal antibodies with l-(P-isothiocyanatobenzyl>DPTA has shown little uptake 

25 in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include l57 Gd, 55 Mn, 162 Dy, 5> Tr, and 

56 Fe. 

Examples of suitable fluorescent labels include an ,52 Eu label, a fluorescein label, an 
30 isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and 
cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an 
35 aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, 
a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei 
such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are provided by 
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Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 
81 : 1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate 
method, the dimaleimide method, the m-maleixnidobenzyl-N-hydroxy-succinimide ester method, 
all of which methods are incorporated by reference herein. 
5 In a related aspect, the invention includes a diagnostic kit for use in screening serum 

containing antibodies specific against B. burgdorferi infection. Such a kit may include an 
isolated B. burgdorferi antigen comprising an epitope which is specifically immunoreactive with 
at least one anti-B. burgdorferi antibody. Such a kit also includes means for detecting the 
binding of said antibody to the antigen. In specific embodiments, the kit may include a 

10 recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or 
polypeptide antigen may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit includes a 
solid support to which said peptide or polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the 

15 antibody to the B. burgdorferi antigen can be detected by binding of the reporter labeled antibody 
to the anti-R burgdorferi polypeptide antibody. 

In a related aspect, the invention includes a method of detecting B. burgdorferi infection in 
a subject. This detection method includes reacting a body fluid, preferably serum, from the 
subject with an isolated B. burgdorferi antigen, and examining the antigen for the presence of 

20 bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to 
a solid support, and serum is reacted with the support. Subsequently, the support is reacted with 
a reporter-labeled anti-human antibody. The support is then examined for the presence of 
reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prepared by known 

25 techniques for attaching protein material to solid support material, such as polymeric beads, dip 
sticks, 96-well plates or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein , typically through a 
free amine group, to a chemically reactive group on the solid support, such as an activated 
carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 

30 conjunction with biotinylated antigen(s). 

The polypeptides and antibodies of the present invention, including fragments thereof, 
may be used to detect Borrelia species including 5. burgdorferi using bio chip and biosensor 
technology. Bio chip and biosensors of the present invention may comprise the polypeptides of 
the present invention to detect antibodies, which specifically recognize Borrelia species, including 

35 B. burgdorferi. Bio chip and biosensors of the present invention may also comprise antibodies 
which specifically recognize the polypeptides of the present invention to detect Borrelia species, 
including 5. burgdorferi or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be used to detect 
Borrelia species, including B. burgdorferi, in biological and environmental samples and to 
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diagnose an animal, including humans, with an B. burgdorferi or other Borrelia infection. Thus, 
the present invention includes both bio chips and biosensors comprising polypeptides or 
antibodies of the present invention and methods of their use. 

The bio chips of the present invention may further comprise polypeptide sequences of 
5 other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic 
detection and diagnosis. The bio chips of the present invention may further comprise antibodies 
or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal 
polypeptide sequences, in addition to the antibodies or fragements thereof of the present 

10 invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an B. burgdorferi or other 
Borrelia infection and to monitor the genetic changes (amio acid deletions, insertions, 
substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present 

15 invention may also be used to simultaneously monitor the expression of a multiplicity of 

polypeptides, including those of the present invention. The polypeptides used to comprise a bio 
chip or biosensor of the present invention may be specified in the same manner as for the 
fragements, i.e, by their N-terminal and C-terminal positions or length in contigious amino acid 
residue. Methods and particular uses of the polypeptides and antibodies of the present invention 

20 to detect Borrelia species, including B. burgdorferi, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. and World 
Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present 
invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and 
World Patent Nos. W09729366, W09612957, each incorporated herein in their entireties. 

25 

Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify those which 
enhance or block the biological activity of the B. burgdorferi polypeptides of the present 
30 invention. The present invention further provides where the compounds kill or slow the growth 
of B. burgdorferi. The ability of B. burgdorferi antagonists, including B. burgdorferi ligands, to 
prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled 
artisan. See, e.g., Straden et al. (1997) JBacteriol. 179(1):9-16. 

An agonist is a compound which increases the natural biological function or which 
35 functions in a manner similar to the polypeptides of the present invention, while antagonists 
decrease or eliminate such functions. Potential antagonists include small organic molecules, 
peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby 
inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross bridge 
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formation. Antibodies against B. burgdorferi may be employed to bind to and inhibit B. 
burgdorferi activity to treat antibiotic resistance. Any of the above antagonists may be employed 
in a composition with a pharmaceutical^ acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more polypeptides of the 
present invention. Heterogeneity in the composition of a vaccine may be provided by combining 
B. burgdorferi polypeptides of the present invention. Multi-component vaccines of this type are 
desirable because they are likely to be more effective in eliciting protective immune responses 

10 against multiple species and strains of the Borrelia genus than single polypeptide vaccines. Thus, 
as discussed in detail below, a multi-component vaccine of the present invention may contain one 
or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 
8, of the B. burgdorferi polypeptides shown in Table 1, or fragments thereof. 

Multi-component vaccines are known in the art to elicit antibody production to numerous 

15 immunogenic components. Decker, M. and Edwards, K., /. Infect Dis. 774:S270-275 (1996). 
In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been 
demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic 
agents. Aristegui, J. et al, Vaccine 75:7-9 (1997). 

The present invention thus also includes multi-component vaccines. These vaccines 

20 comprise more than one polypeptide, immunogen or antigen. An example of such a multi- 
component vaccine would be a vaccine comprising more than one of the B. burgdorferi 
polypeptides shown in Table 1 . A second example is a vaccine comprising one or more, for 
example 2 to 10, of the B. burgdorferi polypeptides shown in Table 1 and one or more, for 
example 2 to 10, additional polypeptides of either borrelial or non-borrelial origin. Thus, a multi- 

25 component vaccine which confers protective immunity to both a borrelial infection and infection 
by another pathogenic agent is also within the scope of the invention. 

As indicated above, the vaccines of the present invention are expected to elicit a protective 
immune response against infections caused by species and strains of Borrelia other than B. 
burgdorferi sensu stricto isolate B31 (ATCC Accession No. 35210). Immunizations using 

30 decorin-binding protein and OspA derived from one strain of B. burgdorferi has been shown to 
elicit the production of antiserum which confers passive immunity against other strains of B. 
burgdorferi. Cassatt, D. et aL, Protection of Borrelia burgdorferi Infection by Antibodies to 
Decorin-binding Protein, in Vacones97, Cold Spring Harbor Press (1997), pages 191-195. 
Further, the inventors have found using an in vitro assay that antiserum produced in response to 

35 B. burgdorferi decorin-binding protein will kill several species of Borrelia. The amino acid 
sequences of decorin-binding protein expressed by different strains of B. burgdorferi are 
believed to diverge by as much as 25%. Thus, antisera elicited against decorin-binding proteins 
confers passive immunity against Borrelia expressing proteins having only 75% or less amino 
acid sequence similarity. 
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Further within the scope of the invention are whole cell and whole viral vaccines. Such 
vaccines may be produced recombinantly and involve the expression of one or more of the 
B. burgdorferi polypeptides shown in Table 1. For example, the 5. burgdorferi polypeptides of 
the present invention may be either secreted or localized intracellular, on the cell surface, or in the 
5 periplasmic space. Further, when a recombinant virus is used, the B. burgdorferi polypeptides 
of the present invention may, for example, be localized in the viral envelope, on the surface of the 
capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing 
heterologous proteins are known in the art. See, e.g., Robinson, K. et aL, Nature Biotech. 
75:653-657 (1997); Sirard, J. et aL, Infect. Immun. 65:2029-2033 (1997); Chabalgoity, J. et aL, 

10 Infect. Immun. 65:2402-2412 (1997). These cells may be administered live or may be killed 
prior to administration. Chabalgoity, J. et aL, supra, for example, report the successful use in 
mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth 
fatty acid-binding protein as a fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in the art by 

15 combining one or more B. burgdorferi polypeptides of the present invention, or fragments 

thereof, with additional non-borrelial components (e.g., diphtheria toxin or tetanus toxin, and/or 
other compounds known to elicit an immune response). Such vaccines are useful for eliciting 
protective immune responses to both members of the Borrelia genus and non-borrelial pathogenic 
agents. 

20 The vaccines of the present invention also include DNA vaccines. DNA vaccines are 

currently being developed for a number of infectious diseases. Boyer, J et aL, Nat. Med. 5:526- 
532 (1997); reviewed in Spier, R., Vaccine 14: 1285-1288 (1996). Such DNA vaccines contain a 
nucleotide sequence encoding one or more B. burgdorferi polypeptides of the present invention 
oriented in a manner that allows for expression of the subject polypeptide. The direct 

25 administration of plasmid DNA encoding OspA has been shown to elicit protective immunity in 
mice against borrelial challenge. Luke, C. et aL, J. Infect Dis. 775:91-97 (1997). 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim, J. et aL, Nature 
Biotech. 75:641-646 (1997), for example, report the enhancement of immune responses produced 

30 by DNA immunizations when DNA sequences encoding molecules which stimulate the immune 
response are co-administered. In a similar fashion, the vaccines of the present invention may be 
co-administered with either nucleic acids encoding immune modulators or the immune modulators 
themselves. These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

35 The vaccines of the present invention may be used to confer resistance to borrelial 

infection by either passive or active immunization. When the vaccines of the present invention are 
used to confer resistance to borrelial infection through active immunization, a vaccine of the 
present invention is administered to an animal to elicit a protective immune response which either 
prevents or attenuates a borrelial infection. When the vaccines of the present invention are used to 



WO 98/59071 



PCT/US98/12718 



39 

confer resistance to borrelial infection through passive immunization, the vaccine is provided to a 
host animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
and direcdy provided to a recipient suspected of having an infection caused by a member of the 
Borretia genus. 

5 The ability to label antibodies, or fragments of antibodies, with toxin molecules provides 

an additional method for treating borrelial infections when passive immunization is conducted. In 
this embodiment, antibodies, or fragments of antibodies, capable of recognizing the 
B. burgdorferi polypeptides disclosed herein, or fragments thereof, as well as other Borrelia 
proteins, are labeled with toxin molecules prior to their administration to the patient. When such 

10 toxin derivatized antibodies bind to Borrelia cells, toxin moieties will be localized to these cells 
and will cause their death. 

The present invention thus concerns and provides a means for preventing or attenuating a 
borrelial infection resulting from organisms which have antigens that arc recognized and bound by 
antisera produced in response to the polypeptides of the present invention. As used herein, a 

15 vaccine is said to prevent or attenuate a disease if its administration to an animal results either in 
the total or partial attenuation (ie., suppression) of a symptom or condition of the disease, or in 
the total or partial immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are 

20 provided in advance of any symptoms of borrelial infection. The prophylactic administration of 
the compound(s) serves to prevent or attenuate any subsequent infection. When provided 
therapeutically, the compound(s) is provided upon or after the detection of symptoms which 
indicate that an animal may be infected with a member of the Borrelia genus. The therapeutic 
administration of the compound(s) serves to attenuate any actual infection. Thus, the 

25 B. burgdorferi polypeptides, and fragments thereof, of the present invention may be provided 
either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after 
the initiation of an actual infection. 

The polypeptides of the invention, whether encoding a portion of a native protein or a 
functional derivative thereof, may be administered in pure form or may be coupled to a 

30 macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable 
proteins which may act as macromolecular carrier for enhancing the immunogenicity of the 
polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, 
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of 
the present invention to such macromolecular carriers are disclosed in Harlow et aL, Antibodies: 

35 A Laboratory Manual, 2nd Ed. ; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1988), the entire disclosure of which is incorporated by reference herein. 

A composition is said to be "pharmacologically acceptable" if its administration can be 
tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an 
agent is said to be administered in a "therapeutically effective amount" if the amount administered 
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is physiologically significant An agent is physiologically significant if its presence results in a 
detectable change in the physiology of a recipient patient 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 
5 composition of a pharmacologically acceptable compound varies with the animal to which it is 
administered. For example, a vaccine intended for human use will generally not be co- 
administered with Freund's adjuvant. Further, the level of purity of the B. burgdorferi 
polypeptides of the present invention will normally be higher when administered to a human than 
when administered to a non-human animal. 

10 As would be understood by one of ordinary skill in the art, when the vaccine of the 

present invention is provided to an animal, it may be in a composition which may contain salts, 
buffers, adjuvants, or other substances which are desirable for improving the efficacy of the 
composition. Adjuvants are substances that can be used to specifically augment a specific 
immune response. These substances generally perform two functions: (1) they protect the 

15 antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically 
stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to the immune 
system, or presented separately, but into the same site of the animal being immunized. Adjuvants 
can be loosely divided into several groups based upon their composition. These groups include 

20 oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, 

AlKtSOJj, AlNaCSO^, AINH^SOJ, silica, kaolin, and carbon), polynucleotides (for example, 
poly IC and poly AU acids), and certain natural substances (for example, wax D from 
Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvwn, or 
BordeteUa pertussis, and members of the genus Brucella. Other substances useful as adjuvants 

25 are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for 
use in the present invention include aluminum salts, such as AlKfSO^, AlNa(S0 4 ) 2 , and 
AINH^SQj). Examples of materials suitable for use in vaccine compositions are provided in 
Remingtoris Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, PA, pp. 1324- 
1341 (1980), which reference is incorporated herein by reference). 

30 The therapeutic compositions of the present invention can be administered parenterally by 

injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, 
or orally. The compositions may alternatively be administered intramuscularly, or intravenously. 
Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 

35 polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absorption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes 
include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents 
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commonly used in the art, such as purified water. Besides the inert diluents, such compositions 

can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 

flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
5 encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis 

infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide- 

co-glycolide) has been shown to stimulate protective immune responses. Shahin, R. et at, 

Infect Immun. 63:1 195-1200 (1995). Similarly, orally administered encapsulated Salmonella 

typhimurium antigens have also been shown to elicit protective immunity in mice. Allaoui- 
10 Attarki, K. et aL 9 Infect Immun. 65:853-857 (1997). Encapsulated vaccines of the present 

invention can be administered by a variety of routes including those involving contacting the 

vaccine with mucous membranes (e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations when a multiple 

administration regimen is utilized It is possible to use the compositions of the invention more 
15 than once to increase the levels and diversities of expression of the immunoglobulin repertoire 

expressed by the immunized animal. Typically, if multiple immunizations are given, they will be 

given one to two months apart. 

According to the present invention, an "effective amount" of a therapeutic composition is 

one which is sufficient to achieve a desired biological effect Generally, the dosage needed to 
20 provide an effective amount of the composition will vary depending upon such factors as the 

animal's or human's age, condition, sex, and extent of disease, if any, and other variables which 

can be adjusted by one of ordinary skill in the art. 

The antigenic preparations of the invention can be administered by either single or multiple 

dosages of an effective amount Effective amounts of the compositions of the invention can vary 
25 from 0 01-1,000 |ig/ml per dose, more preferably 0.1-500 fig/ml per dose, and most preferably 

10-300 jig/ml per dose. 

Having now generally described the invention, the same will be more readily understood 
through reference to the following example which is provided by way of illustration, and is not 
30 intended to be limiting of the present invention, unless specified. 

Examples 

L Preparation of PCR Primers and Amplification of DNA 
35 Various fragments of the Borrelia burgdorferi genome, such as those of Table 1, can be 

used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The 
PCR primers are preferably at least 15 bases, and more preferably at least 18 bases in length. 
When selecting a primer sequence, it is preferred that the primer pairs have approximately the 
same G/C ratio, so that melting temperatures are approximately the same. The PCR primers and 
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amplified DN A of this Example find use in the Examples that follow. 

2. Isolation of a Selected DNA Clone From B* burgdorferi 
Three approaches are used to isolate a 5. burgdorferi clone comprising a polynucleotide of 
5 the present invention from any B. burgdorferi genomic DNA library. The B. burgdorferi strain 
B31PU has been deposited as a convienent source for obtaining a B. burgdorferi strain although a 
wide varity of strains B. burgdorferi strains can be used which are known in the art. 

B. burgdorferi genomic DNA is prepared using the following method A 20ml overnight 
bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain Heart Infusion broth 

10 or Super broth), pelleted, ished two times with TES (30mM Tris-pH 8.0, 25mM EDTA, 50mM 
NaCl), and resuspended in 5ml high salt TES (2.5M NaCl). Lysostaphin is added to final 
concentration of approx 50ug/ml and the mixture is rotated slowly 1 hour at 37C to make 
protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and 
warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is 

15 then added to lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g 
in 5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). A CsCl 
gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and overlaying with 
2.85M CsCl. The gradient is carefully overlayed with the DNA-containing GuHCl solution. The 
gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume 

20 is increased to 5 ml with TE buffer. The DNA is then treated with protease K (10 ug/ml) 
overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a 
desired buffer. 

In the first method, a plasmid is direcdy isolated by screening a plasmid B. burgdorferi 
genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the 
25 present invention. Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. The 

oligonucleotide is labeled, for instance, with ^-y-ATP using T4 polynucleotide kinase and 

purified according to routine methods. (See, e.g., Maniatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is 

30 transformed into a suitable host, as indicated above (such as XL-l Blue (Stratagene)) using 
techniques known to those of skill in the art. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N. Y. 2nd ed. 1989); Ausubel et 
al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N. Y. 
1989). The transformants are plated on 1.5% agar plates (containing the appropriate selection 

35 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. These plates 
are screened using Nylon membranes according to routine methods for bacterial colony screening. 
See, e.g., Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold 
Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN 
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MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to 
those of skill in the ait. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a 
polynucleotide of Table 1 are synthesized and used to amplify the desired DNA by PCR using a 
5 B. burgdorferi genomic DNA prep as a template. PCR is carried out under routine conditions, for 
instance, in 25 |il of reaction mixture with 0.5 ug of the above DNA template. A convenient 
reaction mixture is 1.5-5 mM MgCl^, 0.01% (w/v) gelatin, 20 pM each of dATP, dCTP, dGTP, 
dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 

(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are 

10 performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is 
analyzed by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by subcloning and 
sequencing the DNA product. 

Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

15 synthesized and used to generate a nucleotide sequence of desired length using PCR methods 
known in the art 

3(a). Expression and Purification Borrelia polypeptides in E. coli 
The bacterial expression vector pQE60 is used for bacterial expression of some of the 

20 polypeptide fragements of the present invention. (QIAGEN, Inc., 9259 Eton Avenue, 

Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains 
a bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding site 
("RBS"), six codons encoding histidine residues that allow affinity purification using nickel- 
nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and suitable single 

25 restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA 

fragment encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 X 
His tag") covalently linked to the carboxyl terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a B. burgdorferi protein of the present 
invention is amplified from B. burgdorferi genomic DNA using PCR oligonucleotide primers 

30 which anneal to the 5* and 3' sequences coding for the portions of the B. burgdorferi 

polynucleotide shown in Table 1. Additional nucleotides containing restriction sites to facilitate 
cloning in the pQE60 vector are added to the 5' and 3* sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an appropriate 
restriction site followed by nucleotides of the amino terminal coding sequence of the desired B. 

35 burgdorferi polynucleotide sequence in Table 1. One of ordinary skill in the art would appreciate 
that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to 
amplify a DNA segment encoding any desired portion of the complete protein shorter or longer 
than the mature form. The 3' primer has a sequence containing an appropriate restriction site 
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followed by nucleotides complementary to the 3* end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified B. burgdorferi DNA fragment and the vector pQE60 are digested with 

5 restriction enzymes which recognize the sites in the primers and the digested DN As are then 

ligated together. The B. burgdorferi DNA is inserted into the restricted pQE60 vector in a manner 
which places the B. burgdorferi protein coding region downstream from the IPTG-inducible 
promoter and in-frame with an initiating AUG and the six histidine codons. 

The ligation mixture is transformed into competent E coli cells using standard procedures 

10 such as those described by Sambrook et al., supra.. £ coli strain M15/rep4, containing multiple 
copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing a B. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra). Transformants are identified by their ability to grow on 

15 LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from 

resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and 
DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 jig/ml) and kanamycin (25 fig/ml). The O/N 
20 culture is used to inoculate a large culture, at a dilution of approximately 1 :25 to 1 :250. The cells 

are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-p-D- 

thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lacl repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

25 The cells are then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. The cell 

debris is removed by centrifugation, and the supernatant containing the B. burgdorferi 
polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column 
(QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity 
are purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, 

30 QIAGEN, Inc., supra). Briefly the supernatant is loaded onto the column in 6 M guanidine-HCl, 
pH 8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the B. burgdorferi polypeptide is eluted 
with 6 M guanidine-HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered saline 

35 (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein could be 
successfully refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 
mM Tris/HCl pH 7.4, containing protease inhibitors. The renaturation should be performed over 
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a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 
mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 4° C or frozen at -80° C. 
The polypeptide of the present invention are also prepared using a non-denaturing protein 
5 purification method. For these polypeptides, the cell pellet from each liter of culture is 

resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer A = 50 mM Na-phosphate, 300 
mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 with 1 tablet of Complete EDTA- 
free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). 
Absorbance at 550 nm is approximately 10-20 O.D./ml. The suspension is then put through three 

10 freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. The cells 
are lysed via sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on 
ice. The sonicated sample is then centrifuged at 15,000 RPM for 30 minutes at 4°C. The 
supernatant is passed through a column containing 1.0 ml of CL-4B resin to pre-clear the sample 
of any proteins that may bind to agarose non-specifically, and the flow-through fraction is 

15 collected. 

The pie-bleared flow-through is applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag bind to the Ni-NTA 
resin with high affinity and can be purified in a simple one-step procedure. Briefly, the 
supernatant is loaded onto the column in Lysis Buffer A at 4°C, the column is first washed with 

20 10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the 
column is washed with 5 volumes of 40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) 
(Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 
500 mM Imidazole, pH of the final buffer should be 7.5). The protein is eluted off of the column 
with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to 

25 Buffer B. Three different concentrations are used: 3 volumes of 75 mM Imidazole, 3 volumes of 
150 mM Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein are analyzed using 8 %, 10 % or 14% SDS-PAGE depending on the protein size. The 
purified protein is then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it 
into an easily workable buffer. The purified protein is stored at 4° C or frozen at -80°. 

30 The following alternative method may be used to purify B. burgdorferi expressed in E coli 

when it is present in the form of inclusion bodies. Unless otherwise specified, all of the 

following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 

cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 

35 Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 

amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 
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The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, 
Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
5 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 

B. burgdorferi polypeptide-containing supernatant is incubated at 4°C overnight to allow further 

GuHCl extraction. 

10 Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 

GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 

The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 

purification steps. 

15 To clarify the refolded J?, burgdorferi polypeptide solution, a previously prepared 

tangential filtration unit equipped with 0. 16 jxm membrane filter with appropriate surface area 

(e.g., Rltron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 

20 1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 

effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 

25 Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 

acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant A^ monitoring of the effluent. Fractions containing the B. 

30 burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 

stained 16% SDS-PAGE gel when 5 \ig of purified protein is loaded. The purified protein is also 

tested for endotoxin/LPS contamination, and typically the LPS content is less than 0. 1 ng/ml 
35 according to LAL assays. 



3(b). Alternative Expression and Purification Borrelia polypeptides in E. 
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coli 

Tthe vector pQElO is alternatively used to clone and express some of the polypeptides of 
the present invention for use in the soft tissue and systemic infection models discussed below. 
The difference being such that an inserted DNA fragment encoding a polypeptide expresses that 
5 polypeptide with the six His residues (i.e., a "6 X His tag") covalently linked to the amino 

terminus of that polypeptide. The bacterial expression vector pQElO (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 9131 1) was used in this example . The components of the pQElO 
plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present 
invention expresses the polypeptide with the six His residues (Le. t a "6 X His tag")) covalently 

10 linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 1 were 
amplified using PCR oligonucleotide primers from genomic B. burgdorferi DNA. The PCR 
primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a 
polypeptide of the present invention. Additional nucleotides containing restriction sites to 

15 facilitate cloning in the pQElO vector were added to the 5* and 3' primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3' primers were selected to 
amplify their respective nucleotide coding sequences. One of ordinary skill in the art would 
appreciate that the point in the protein coding sequence where the 5 1 and 3' primers begins may be 
varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present 

20 invention. The 5' primer was designed so the coding sequence of the 6 X His tag is aligned with 
the restriction site so as to maintain its reading frame with that of B. burgdorferi polypeptide. The 
3 1 was designed to include an stop codon. The amplified DNA fragment was then cloned, and the 
protein expressed, as described above for the pQE60 plasmid. 

The DNA sequences of Table 1 encoding amino acid sequences may also be cloned and 

25 expressed as fusion proteins by a protocol similar to that described directly above, wherein the 
pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 5371 1) is preferentially used in 
place of pQElO. 

The above methods are not limited to the polypeptide fragements actually produced. The 
above method, like the methods below, can be used to produce either full length polypeptides or 
30 desired fragements therof. 

3(c). Alternative Expression and Purification of Borrelia polypeptides in 
E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this example 
35 (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9131 1). However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His codons is prevented 
and, therefore, the polypeptide is produced with no 6 X His tag. 

The DNA sequence encoding the desired portion of the B. burgdorferi amino acid 
sequence is amplified from an B. burgdorferi genomic DNA prep the deposited DNA clones 
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using PCR oligonucleotide primers which anneal to the 5* and 3' nucleotide sequences 
corresponding to the desired portion of the B. burgdorferi polypeptides. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' and 3' 
primer sequences. 

5 For cloning a B. burgdorferi polypeptides of the present invention, 5' and 3' primers are 

selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art 
would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin 
may be varied to amplify a DN A segment encoding any desired portion of a polypeptide of the 
present invention. The 3* and 5' primers contain appropriate restriction sites followed by 

10 nucleotides complementary to the 5' and 3* ends of the coding sequence respectively. The 3' 
primer is additionally designed to include an in-frame stop codon. 

The amplified B. burgdorferi DNA fragments and the vector pQE60 are digested with 
restriction enzymes recognizing the sites in the primers and the digested DNAs are then ligated 
together. Insertion of the B. burgdorferi DNA into the restricted pQE60 vector places the B. 

15 burgdorferi protein coding region including its associated stop codon downstream from the DPTG- 
inducible promoter and in-frame with an initiating AUG. The associated stop codon prevents 
translation of the six histidine codons downstream of the insertion point 

The ligation mixture is transformed into competent E. coli cells using standard procedures 
such as those described by Sambrook et al. E. coli strain M15/rep4, containing multiple copies of 

20 the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 

("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing B. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra). Transformants are identified by their ability to grow on 
LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant 

25 colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 Jig/ml) and kanamycin (25 jig/ml). The O/N 
culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells 
30 are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequendy are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the B. burgdorferi polypeptide, the cells are then stirred for 3-4 hours at 4°C in 

35 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant 
containing the B. burgdorferi polypeptide is dialyzed against 50 mM Na-acetate buffer pH 6, 
supplemented with 200 mM NaCl. Alternatively, the protein can be successfully refolded by 
dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HQ pH 7.4, containing protease 
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inhibitors. After renaturation the protein can be purified by ion exchange, hydrophobic interaction 
and size exclusion chromatography. Alternatively, an affinity chromatography step such as an 
antibody column can be used to obtain pure B. burgdorferi polypeptide. The purified protein is 
stored at 4°C or frozen at -80° C. 
5 The following alternative method may be used to purify 5. burgdorferi polypeptides 

expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 

specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the K coli fermentation, the cell culture is 

cooled to 4-1 0°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 

10 Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 

amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer (Microfuidics, 

15 Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
20 (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 

& burgdorferi polypeptide-containing supernatant is incubated at 4°C overnight to allow further 

GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 
GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
25 buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 

The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 

purification steps. 

To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface area 

30 (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 
1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 
effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

35 Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
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Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 
acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
5 are collected under constant monitoring of the effluent Fractions containing the B. 
burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 

stained 16% SDS-PAGE gel when 5 ^tg of purified protein is loaded. The purified protein is also 
10 tested for endotoxin/LPS contamination, and typically the LPS content is less than 0. 1 ng/ml 
according to LAL assays. 

3(d). Cloning and Expression of B. burgdorferi in Other Bacteria 

B. burgdorferi polypeptides can also be produced in: B. burgdorferi using the methods of 
15 S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein Expr. Purif. 
8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. Microbiol. 
Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et al., U.S. Patent 
No. 4,952,508. 

20 4. Cloning and Expression in COS Cells 

A B. burgdorferi expression plasmid is made by cloning a portion of the DN A encoding a 
fl. burgdorferi polypeptide into the expression vector pDNAI/Amp or pDNAHI (which can be 
obtained from Invitrogen, Inc.). The expression vector pDNAI/amp contains: (1) an £. coli 
origin of replication effective for propagation in E coli and other prokaryotic cells; (2) an 

25 ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) an SV40 origin 
of replication for propagation in eukaryotic cells; (4) a CMV promoter, a polylinker, an SV40 
intron; (5) several codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate 
purification) followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and operably linked to 

30 the SV40 intron and the polyadenylation signal by means of restriction sites in the polylinker. 

The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described 
by Wilson et al. 1984 Cell 37:767. The fusion of the HA tag to the target protein allows easy 
detection and recovery of the recombinant protein with an antibody that recognizes the HA 
epitope. pDNAIII contains, in addition, the selectable neomycin marker. 

35 A DNA fragment encoding a B. burgdorferi polypeptide is cloned into the polylinker 

region of the vector so that recombinant protein expression is directed by the CMV promoter. The 
plasmid construction strategy is as follows. The DNA from a B. burgdorferi genomic DNA prep 
is amplified using primers that contain convenient restriction sites, much as described above for 
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construction of vectors for expression of B. burgdorferi in E. coli. The 5' primer contains a 
Kozak sequence, an AUG start codon, and nucleotides of the 5* coding region of the B. 
burgdorferi polypeptide. The 3' primer, contains nucleotides complementary to the 3' coding 
sequence of the B. burgdorferi DNA, a stop codon, and a convenient restriction site. 
5 The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 

appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an 

appropriate E. coli strain such as SURE™ (Stratagene Cloning Systems, La Jolla, CA 92037), 

and the transformed culture is plated on ampicillin media plates which then are incubated to allow 
growth of ampicillin resistant colonies. Plasmid DNA is isolated from resistant colonies and 

10 examined by restriction analysis or other means for the presence of the fragment encoding the B. 
burgdorferi polypeptide 

For expression of a recombinant B. burgdorferi polypeptide, COS cells are transfected 
with an expression vector, as described above, using DEAE-dextran, as described, for instance, 
by Sambrook et al. (supra). Cells are incubated under conditions for expression of B. 

15 burgdorferi by the vector. 

Expression of the B. burgdorferi-HA fusion protein is detected by radiolabeling and 
immunoprecipitation, using methods described in, for example Harlow et al., supra.. To this 
end, two days after transfection, the cells are labeled by incubation in media containing 35 S- 
cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed 

20 with detergent-containing RIPA buffer 150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% 
DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. (supra ). Proteins are precipitated 
from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The 
precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
product of the expected size is seen in the cell lysate, which is not seen in negative controls. 

25 

5. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of B. burgdorferi polypeptide in this example. 
Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 37146). The 
plasmid contains the mouse DHFR gene under control of the SV40 early promoter. Chinese 

30 hamster ovary cells or other cells lacking dihy drofolate activity that are transfected with these 
plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life 
Technologies) supplemented with the chemotherapeutic agent methotrexate. The amplification of 
the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, e.g., 
Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. 

35 Acta, 1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target enzyme, 
DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR 
gene, it is usually co-amplified and over-expressed. It is known in the art that this approach may 
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be used to develop cell lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the 
amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse 
5 Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. (1985) Mol. Cell. Biol. 
5:438-447; plus a fragment isolated from the enhancer of the immediate early gene of human 
cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter 
are the following single restriction enzyme cleavage sites that allow the integration of the genes: 
Bam HI, Xba I, and Asp 718. Behind these cloning sites the plasmid contains the 3* intron and 

10 polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be 
used for the expression, e.g., the human 6-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVL Clontech's Tet-Off and Tet- 
On gene expression systems and similar systems can be used to express the B. burgdorferi 
polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. 

15 USA 89:5547-555 1 . For the polyadenylation of the mRNA other signals, e.g., from the human 
growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest 
integrated into the chromosomes can also be selected upon co-transfection with a selectable 
marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable 
marker in the beginning, e.g., G418 plus methotrexate. 

20 The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated 

using calf intestinal phosphates by procedures known in the art The vector is then isolated from 
a 1% agarose gel. The DNA sequence encoding the B. burgdorferi polypeptide is amplified using 
PCR oligonucleotide primers corresponding to the 5* and 3' sequences of the desired portion of 
the gene. A 5' primer containing a restriction site, a Kozak sequence, an AUG start codon, and 

25 nucleotides of the 5' coding region of the B. burgdorferi polypeptide is synthesized and used. A 
y primer, containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the B. burgdorferi polypeptides is synthesized and used. The amplified 
fragment is digested with the restriction endonucleases and then purified again on a 1% agarose 
gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 

30 K coli HB101 or XLr 1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five 
jig of the expression plasmid pC4 is cotransfected with 0.5 |xg of the plasmid pSVneo using a 

lipid-mediated transfection agent such as Lipofectin™ or LipofectAMINE.™ (LifeTechnologies 

35 Gaithersburg, MD). The plasmid pSV2-neo contains a dominant selectable marker, the neo gene 
from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. 
The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the 
cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
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MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Qones growing at the highest concentrations of methotrexate are then transferred to new 6-well 
5 plates containing even higher concentrations of methotrexate (1 pM, 2 |iM, 5 pM, 10 mM, 20 
mM). The same procedure is repealed until clones are obtained which grow at a concentration of 
100-200 \iM. Expression of the desired gene product is analyzed, for instance, by SDS-PAGE 
and Western blot or by reversed phase HPLC analysis. 

6. Immunization and Detection of Immune Responses 

10 

6(a). B. burgdorferi propagation 

B. burgdorferi sensu stricto isolate B31 is propagated in tightiy-closed containers at 34°C 
in modified Barbour-Stoenner-Kelly (BSKII) medium (Barbour, A.G., Yale J. Biol Med. 
57:521-525 (1984)) overlaid with a 5%(y5%C(V90%N 2 gas mixture. Cell densities of these 

15 cultures are determined by darkfield microscopy at 400X. 

Immunization of Mice and Challenge with B. burgdorferi. For active immunizations 
B ALB/cByJ mice (BALB, Jackson Laboratories) are injected intraperitoneally (i.p.) at week 0 
with 20 g of recombinant borrelial protein, or phosphate-buffered saline (PBS), emulsified with 
complete Freund's adjuvant (CFA), given a similar booster immunization in incomplete Freund's 

20 adjuvant (IFA) at week 4, and challenged at week 6. For challenge B. burgdorferi are diluted in 
BSKII from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the 
base of the tail with 0.1 ml of these dilutions (typically 10M0 4 borreliae; approximately 10-100 
times the median infectious dose). Borreliae used for challenge are passaged fewer than six times 
in vitro. To assess infection, mice are sacrificed at 14-17 days post-challenge, and specimens 

25 derived from ear, bladder, and tibiotarsal joints are placed in BSKII plus 1 .4% gelatin, 1 3 g/ml 
amphotericin B, 1.5 g/ml phosphomycin, and 15 g/ml rifampicin, and borrelia outgrowth at two 
or three weeks is quantified by darkfield microscopy. Batches of BSKII are qualified for 
infection testing by confirming that they supported the growth of 1-5 cells of isolate B3L In 
some instances seroconversion for protein P39 reactivity is also used to confirm infections (see 

30 below). Others have previously shown that mice elicited antibodies to P39 when inoculated with 
live borreliae by syringe or tick bite, but not with killed borreliae (Simpson, W.J., et aL, J. Clin. 
Microbiol. 29:236-243 (1991)). 

6(b). Immunoassays 

35 Several immunoassay formats are used to quantify levels of borrelia-specific antibodies 

(HI ISA and immunoblot), and to evaluate the functional properties of these antibodies (growth 
inhibition assay). The HI ASA and immunoblot assays are also used to detect and quantify 
antibodies elicited in response to borrelial infection that react with specific borrelial antigens. 
Where antibodies to certain borrelial antigens are elicited by infection this is taken as evidence that 
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the bonelial proteins in question are expressed in vivo. Absence of infection-derived antibodies 
(seroconversion) following borrelial challenge is evidence that infection is prevented or 
suppressed. The immunoblot assay is also used to ascertain whether antibodies raised against 
recombinant borrelial antigens recognize a protein of similar size in extracts of whole borreliae. 
5 Where the natural protein is of similar, or identical, size in the immunoblot assay to the 

recombinant version of the same protein, this is taken as evidence that the recombinant protein is 
the product of a full-length clone of the respective gene. 

Enzyme-Linked Immunosorbent Assay (EUSA). The ELISA is used to quantify levels of 
antibodies reactive with borrelial antigens elicited in response to immunization with these borrelial 

10 antigens. Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Virginia, or 
equivalent) are coated with antigen by incubating 50 1 of 1 g/ml protein antigen solution in a 
suitable buffer, typically 0. 1 M sodium carbonate buffer at pH 9.6. After decanting unbound 
antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash 
buffer (PBS, 0.2% Tween 20, pH 7.4). After washing, duplicate serial two-fold dilutions of sera 

15 in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed 
three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After 
three washes, bound antibodies are detected with HjOj and 2,2-azino-di-(3-ethylbenzthiazoline 
sulfonate) (Schwan, T.G., etal., Proc. Natl Acad ScL USA 92:2909-2913 (1985)) (ABTS®, 
Kirkegaard & Perry Labs., Gaithersburg, MD) and A^ is quantified with a Molecular Devices, 

20 Corp. (Menlo Park, California) Vmax™ plate reader. IgG levels twice the background level in 
serum from naive mice are assigned the minimum titer of 1 : 100. 

6(c). In Vitro Growth Inhibition Assay 

Unlike other bacteria, borreliae can be killed by the binding of specific antibodies to their 
25 surface antigens. The mechanism for this in vitro killing or growth-inhibitory effect is not 

known, but can occur in the absence of serum complement, or other immune effector functions. 
Antibodies elicited in animals receiving immunizations with specific bonelial antigens that result 
in protection from borrelial challenge usually will directly kill borreliae in vitro. Thus, the in vitro 
growth inhibition assay also has a high predictive value for the protective potency of the borrelial 
30 antibodies, although exceptions, such as antibodies against OspC which are weak at in vitro 
growth inhibition, have been observed. Also, this assay can be used to evaluate the serologic 
conservation of epitope binding protective antibodies. A microwell antibody titration assay 
(Sadziene, A., et al., J. Infect. Dis. 167: 165-172 (1993)) is used to evaluate the growth inhibition 
(GI) properties of antisera against recombinant borrelial antigens against the homologous B31 
35 isolate, and against various strains of borrelia. Briefly, 10 5 borrelia in 100 1 BSKH are added to 
serial two-fold dilutions of sera in 100 1 BSKQ in 96- well plates, and the plates are covered and 
incubated at 34°C in a 5%O 2 /5%CO 2 /90%N 2 gas mixture for 72 h prior to quantification of 
borrelia growth by darkfield microscopy. 
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6(d). Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis 
(SDS-PAGE) and lmmunoblotting 

Using a single well format, total borrelial protein extracts, recombinant borrelial antigen, 
or recombinant P39 samples (2 g of purified protein, or more for total borrelial extracts) are 

5 boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, 
and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. 
Gels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of 
antibody to be tested for reactivity with specific borrelial antigens, followed by the appropriate 
secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm 

10 that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. 
Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using 
ECL™ reagents (Amersham Corp., Arlington Heights, Illinois). 

6(e). Detection of Borrelia mRNA expression 
15 Northern blot analysis is carried out using methods described by, among others, 

Sambrook et al, supra, to detect the expression of the B. burgdorferi nucleotide sequences of the 
present invention in animal tissues. A cDNA probe containing an entire nucleotide sequence 
shown in Table 1 is labeled with ^ using the redrprime™ DNA labeling system (Amersham Life 
Science), according to manufacturer's instructions. After labeling, the probe is purified using a 
20 CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), according to manufacturer's 
protocol number PT 1200- 1 . The purified labeled probe is then used to detect the expression of 
Borrelia mRNA in an animal tissue sample. 

Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using 
ExpressHyb™ hybridization solution (Clontech) according to manufacturer's protocol number 
25 PT1 190-1 . Following hybridization and washing, the blots are mounted and exposed to film at - 
70 C overnight, and films developed according to standard procedures. 

The disclosure of all publications (including patents, patent applications, journal articles, 
laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference 
in their entireties. 

30 The present invention is not to be limited in scope by the specific embodiments described 

herein, which are intended as single illustrations of individual aspects of the invention. 
Functionally equivalent methods and components are within the scope of the invention, in 
addition to those shown and described herein and will become apparent to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to 

35 fall within the scope of the appended claims. 

Provisional Application Serial No. 60/057,483 filed 3 September 1997 is incorporated by 
reference herein in its entirety. 
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TABLE 1 . Nucleotide and Amino Acid Sequences 
flOl.aa 

MSKIFLLFNAGFFFLKIIYWSYPEIKNFSRQDP 
WQSGIATINIJ^DSPAVSVGQEILIPSKKGVFW^ 

DFGLFSGTELLFFLNANFIFPLKKFIVSSDFGFRNDPFTGNKSFHTGIDIjAAPMNA 
tlOl.aa 

SYPEIKNFSRQDPVFSDLKIKVLKYNKKQHI PLFFYSYKVKKGDTFFKI ANKINGWQSGIATINLLDS PAVSVGQE 

ILIPSKKGVFWDSKDYRFNNLI^TRD^ 

LKKFIVSSDFGFRNDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 



flOl.nt 

ATGAGTAAAATTTTTTTATTATTTAA 

TAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTC 

TATTCCrcTGTTTTTTTACTCATATAAAGTTAA 

TGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGC 

GTAAAAAAGGAGTTTTTGTTTTTGATAGTT^AAGATTATAGA 

TAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTO 

GATTTTGGACTTTTTTCAGGCACAGAATT^ 

TTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTT^ 

AGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 



tlOl.nt 

TCTTATCC^GAAATAAAAAATTTCTCAAGGCA 

ACAAAAAACAACATATTCCTCTGTTTTTTTACTCATATAAA 

TAAAATAAATGGATGGCAGTCCGGCATTGCTACTA 

ATTCTTATTCCCAGTAAAAAAGGAGTTTTTGTTTTTC 

CAAGGGATCTTGCTAAAGCTGAAAAGGTAAAAATO 

TGTTAAGAATCCAGATTTTGGACT^ 

TTAAAAAAATTTATTGTTAGTTCTGATTT^ 

GAATAGATCTTGCAGCTCCAATGAATGCTGAAGTGTATCT 

fll.aa 

VKKYIKTIFLISMVYFYCCTC^ 

NDKIVLKKEDLTIIONETGYKNKYREFFIGPKTSFKFKVYPLKI^ 
TINVLITRTTKINITNK 

tll.aa 

CCTTIKINHDYETDFK\^ESPSKYINIDVI 
GYKNKYREFFIGPKTSFKFKVYPLKIHSKNK^^ 

fll.nt 

GTGGAAA7VATTTCTTTTATTCCAGGAAATGAAAATATO 
TCAAAAAATACATAAAAACAATATTTCTGATTTCAATGGTTO 
TGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCT 
GAATATATTTATATTCAAATTACAAACAATA^ 

ACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACAGGGTATAA 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

TTTTATTGGTCCTAAAACTTCATTTA 
AACTTAAGCTCAACTATTAAATATCCGTCTATTTT^ 

CAATAAATGTTTTAATAAC^GAACTACAAAAATTAATATTACTAATAAAT^ 
tll.nt 

TGTTGTACGACAATAAAAATAAACCATGATTATGAAACTGATTTT 

ATATAGATGTAATTAAAGCTACAAATGAATATATTTATATTCAAATTACAAAC^ 

AAATTGGCAAAACACTAGTCTTAACAACGATAAGATC^ 

GGGTATAAAAATAAATAC^GAGAGTTTTTTATO 

TTCATTCTAAAAACAAAAATAGCAATAACTTAAGCTCAAOT 

AAAAGTAGGAATTGAAGCAAAAAAAACAATAAATGTTTTAATAAC^ 

TGA 

fl2.aa 

MREFLYRNVFKKSFIVFLIFLTFSNAIFAQTIDDEBISKK^ 

SKIRELLLKKFGIDPELFIKGKGLAGSGRYKI I IETADNLEWFTYGLTKDESIIFEGRVNILVEDIKENKKHNIKG 
DRIVLNKNSKKLYAIGNVEYI LDMDTNEKLYFYGNEFLVDFDSQNFLLiKNGIIiQKKMQKNQIDHI LSFGGKVLKKI 
DNDVTILEQAFATTSKI PEPYYSIKASKIWALPSGDFGFLNAIFYMGRVPVFYI PFFFRPGDSLFFNPSLGLNPRK 
GFSVFNTVYLFGNKSSSEDSSFLDFDFNSVYNSGKKPYIRNGYLTYFFAENIA 
IDFNLGNTLGHFKTLEGNFGI^FTRNVYSYIXX3YYPFDNRT1^ 
SDALFSVVLEHYSDPYVNIDFRDRIESATFFSLLNLDKDSVKEQTSISTFDWNLSS 

LSFKLSGYENLYVKS PLEKPKDVNDPTRKWFYLERIYAPYI DLNFQKDLYNNQWTFPADTKEMIMRPEIKNLEDKD 

NDKKSVKEKNTKKTTELTKDLYIPPEPITLKNIDQSDSFFIRFGINPYLRNNVFFD 

IKNKTDIKIHADFYNRLITFENLLYL^ 

NKATLYSFNKKYDSDVKSLVNKNSS I FLSDPETFYQSLTASLI YDYDYFTTELSGELKNSFEDI KASSELKLSLDF 

PYLLQEAGIGIKYYKKFKEDAMKNSGISAVQS PLEPQKPS SPYKNLEMS PALYYKIEPRYLDYFKFSFLVAYDPLI 

NRVSELSFKLNVFDFQFLFAMKDDFEYNYDPLKGDFSM^ 

GWKINLQKFTDNELRSALTLKFKYTEFLEIYFST^ 

DSLFKIKKFSSGFKFNFYDWKFVGEYNLEPDLLRGSIXjIYSPI^ 

NRKTKK 

1 12 . aa 

IFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDVS 

sgryki 1 1 etadnlenftygltkde s 1 1 fegrvni lvedikenkkhnikgdri vlnknskklyai gnveyi ldmdt 
nelklyfygneflvdfdsqnfllkngilqkkmqkn^ 

askiwalpsgdfgflnai fymgrvpvfyi pfffrpgdslffnpslgij^rkgfsvfntvylfgnksssedssfldf 

dfnsvynsgkkpyirngyltyffaenlapsvnkdyw 

nvysydggyypfdnrtlkqslfsfsnlnkgdwgfevpfrylfkfkt 

esatffsllnldkdsvkeqtsistfdwn^ 

ptrkwfyleriyapyidlnfqkdlynnqwtfpai^^ 

epitlknidqsdsffirfginpylrnnvffdnygitspro 

lntieysplnkdfkvedkdkksehsiinqinll^ 

iflsdpetfyqsltasliydydyfttelsgelknsfedikasselklsldfpyllqeagigikyykkfkro 
gi savqsplepqkps spyknlemspalyykiepryldyfkf 
eynydplkgdfskigtttklvpysldssykkelyvltffdnklsftlgvdv^ 
efleiyfstlsintktfkyfkgymtoiglefvnffto^ 

YNLEPDIJLjRGSIXalYSPIWRNNFTIYISWN^ 
fl2.nt 

atgcgagaattcctatacaggaatgttttt^ 

c^tttttgcccagactatagatgatgaaaat^ 

aagagaacttgagctttcaaccgatgaggattt^ 
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TCAAAAATACGAGAATTGCTT^ 

GATCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTC 

AAGTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAG^ 

GACAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATG 

CCAATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTC 

TATTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCAT^ 

GACAATGATGTTACCATTTTGGAACAAGCTTTTGC^ 

CTTCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTT^ 

ATTTTATATTCCTTTTTTTTTCAGACCGGGAGATA 

GGTTTTTCTGTTTTTAATACCGTTTATCTTTT^ 

ATTTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATA^ 

TTTAGCACCCAGTGTTAATAAAGATTATGTTAA 

ATTGATTTTAATTTGGGCAATACTTTGGGGCATTOT 

ATGTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTT^ 

TCTTAACAAAGGAGATGTATTTGGGTTTGAAGTTC 

AGTGATGCACTTTTCTCGGTTGTTTTAGAGCAOT 

AAAGTGCTACATTTTTTTCTCTTTTAAATTTA^ 

TTGGAATTTATCTTCTTTTTATAAGCGAACATTO 

TTAAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAA 

CTACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCA 

TAACCAATGGACATTTCCAGCTGATACTAAAGAAATGAT^ 

AATGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAAGAGAATO 

AACCAATTACTTTAAAAAATATTGATCAATCC 

TAATGTTTTTTTTGATAATTATGGCATAACAAG 

ATAAAAAATAAAACGGATATAAAAATTCATGCTGATTT^ 

TTAATACTATTGAGTATAGTCCTTTAAATAAAGATTTT^ 

TATTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATC 

AATAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGA 

TTTTTTTATCTGATCCGGAAACTTTTTATCAAAGTOT 

TGAGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGAT^ 

CCTTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAA^ 

GAATTTCTGCTGTTCAAAGTCCTTTGGAGCCTC 

TTTGTATTATAAAATTGAGCCGAGATATTTGGATTATTTT 

AATAGAGTTTCTGAACTTTCTTTTAAGCTTAATGT^ 

AATATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACT 

TTCTAGTTACAAAAAGGAATTGTACGTTTTAACTTTCT 

GGTTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTC 

AATTTTTAGAAATTTACTTTTCTACTTO 

AATTGGTOTAGAACCrrcTTAATTTCTTTGTTGA 

GATTCACTTTTTAAAATTAAAAAATTTTCATCA 

ATAATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTO 

TATTTCTTGGAACTTTTTTGCTCCTATAA^ 

AATAGAAAAACAAAAAAATAA 

tl2.nt 

ATTTTTGCCCAGACTATAGATGATGAAAATTCTAA 

GAGAACTTGAGCTTTCAACCGATGAGGATTTAAAAAAATGGGCC^ 

AAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCT^ 

TCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAA^^ 

GTATTATTTTTGAAGGAAGAGTTAATATCTTGGTT^ 

CAGAATAGTCCTTAATAAGAACTCTAAAAAAC 

AATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTC 

TTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATC^ 

CAATGATGTTACGATTTTGGAACAAGCTTT^ 

TCTAAAATATGGGCATTGCCCTCGGGAGATTTO 



WO 98/59071 



PCT/US98/12718 



59 

TABLE 1 . Nucleotide and Amino Acid Sequences 

TTTATATTCCTTTTTTTTTCAG 

TTTTTCTGTTTTTAATACCGTTTATCTTTT^ 

TTCAATTCTCTTTATAATTCGGGT^ 

TAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTQACA 
TGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAAC 

GTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATC 

TTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTO 

TGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCC 

AGTGCTACATTTTTTTCTCTTTTAAATO 

GGAATTTATCTTCTTTTTATAAGCGAACATTTAATGAC^ 

AAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATC 

ACAAGAAAATGGTTTTATTTGGAGAGAATTTATC 

ACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCA 

TGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACT^CAGAATTAACCAAAGATTT 

CCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATO 

ATGTTTTTTTTGATAATTATGGCATAACAAGTC 

AAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAA 

AATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAA 

TTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATC 

TAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATC 

TTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAAC 

AGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCT^ 

TTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTA^ 

ATTTCTGCTGTTCAAAGTCCTTTGGAGCCTC 

TGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTT 

TAGAGTTTCTGAACTTTCTTTTAAGCTTAATC 

TATAATTATGATCCTTTAA/^GGAGATTTTTCCAAGATTGGTAC 

CTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTG^ 

TTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTC 

TTTTTAGAAATTTACTTTTCTACTTTATC 

TTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCA 

TTCACTTTTTAAAATTAAAAAATTTTCATCAG 

AATTOAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTC 

TTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTC 

TAGAAAAACAAAAAAATAA 

fl29.aa 

MTKKLFVRVLIFLISNNYAFAKOTIKDLFFIQDILIKKECT 
NGYRRNFNFFNLUSPTSNIIKSLSLFDSRPKNIKENEIILLETKMIKE^ 
DFNFLFDQRKTFPS I YIKEEDVSTI INSFMKLQDSSFLSPQAS 

tl29.aa 

KDTIKDLFFIQDILIKKEKYSEVLNNASLEGIIEIEHNGPYIKDHDSEVK^ 

LSLFDSRPKNIKENEIILLETKMIKENPTC^ 

VSTI INSFMKLQDS SFLSPQAS 

fl29.nt 

ATGACAAAAAAATTGTTTGTGAGGGTATTAATCTTTOT 

AAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAAT^ 

AGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGAT^ 

AACXX^TATAGAAGAAATTTCAACTTTTTTAATC 

ACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGA 

ACGATACAAAGACGATGATGATTTTGAATTAA^ 
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GATTTCAATTTCCTATTTGATC 
TAAACAGCTTCATGAAACTACAAG 

tl29.nt 

AAAGACACAATCAAAGATTTGTTCTTTATACAA 

ATGCAAGCCTTGAAGGCATTATTGAAATTGAAC^ 

TATCCTAAAAGAAAACGGATATAGAAGAAATTTCAACTTTTTTAATC 

CTAAGCTTATTTGACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGAC 

AAAATCCCTATAAACGATACAAAGACGATGATGATTTTGAATTAAAACTAAGTGTAACTC 

TTATTTAATTCTTGATTTCAATTTCCTATTTGATC 

GTATCAACAATAATAAACAGCTTCATGAAACTACAAGATTCAAGCTTTT^ 
fl42.aa 

MDKISILYTLINIIIMLILISIWLCKRK^ 

VRLLKMI 1 1 PLI ITSI ISAI IKLTNSKDVGKMSLLVILTLVFTAGIAAI IGIFTALALGLTAEGLQAGTIEILQSE 

KLQKGLEILNQTTITKKITDLIPQNIFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKP 

IIXSVVTLILKLTPYAIIALOT^ 

FAFISRSSAATIPINIEIQTKNIX5VSEGIANLSSSFGTSI^ 

LIIITSFGAAGAGGGATTASLMVLSAMNFPVGLVGLVI 

QKELVNK 

tl42.aa 

CKRKNVSFTKRWLALAIGIWGMTIQYFYG™^ 

SKDVGKMSLLVILTLVFTAGI AAI IGIFTALALGLTAEGLQAGTI E I LQSEKLQKGLEI LNQTTITKKITDLI PQN 

IFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPESIEFFKKIIL^ 

TSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAI 

SEGIANLS S SFGTS IGQNGCAALHPAMLAIMIAPTQGINPTDI SF ILTLIGL 1 1 ITSFGAAGAGGGATTASLMVLS 
AMNFPVGLVGLVT SVEPI IDMGRTAVNVGGSMLAGVI SAKQLKQFNHNI YNQKELVNK 



fl42.nt 

TAAGAGGTAATAATGGATAAAATAAGTATATTATA^ 

TAGTTTATCTTTGTAAAAGAAAAAATGTTTCTT 

TGGAATGACCATTCAATATTTTTAT^ 

GGCGATGGATACGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCA^ 

AACTAACCAATAGTAAAGATGTTGGGAAAATGAGCCT^ 

TGCCATAATTGGCATTTTCACTGCTTTAGCATTGGGACT 

TTACAAAGTGAAAAATTGCAAAAAGGCCTTGAAATATC^ 

TTCCACAAAATATATTTGAAGATTTTGCAGG 

CATAGGAATAGCCGCCCTTAAAACATCTATCAAAAAGCCAGAATC 

CTCCAAGACATAATATTAGGTGTAGTAACTTTGATTTTAAAAOT 

AAATTACAGCAACCAGCGAAATOVAAAGC^TAATAAAGCTTGGAGAATTT 

TCTTACATTTCTTATGCATATGACATTAATTGCAATAAATAAATTA 

CCAGCACTATCATTTGCATTCATATCTAGGTCGAGTGCTG 

ATCTGGGAGTAAGCGAAGGAATAGCAAATTTATC 

ACTACACCCCGCTATGCTTGCAATAATC^ 

ACACTTATTGGATTAATAATAATAACTTCATTTGGAGCTGC 

TGGTGCTCTCAGCAATGAACTTTCCAGTGGGATTG^ 

AAGAACAGCTGTTAATGTAGGCGGCTCAATGCTTGCAGGCGT^ 

AATATATACAACCAAAAAGAGCTTGTAAACAAATAA 



t!42.nt 
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TGTAAAAGAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCG 
TTCAATATTTTTATGGAACAAATTCAGAAATAA 

CGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATA 

AGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATT^ 

GCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTAC^ 

AAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCA 

ATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTC 

CCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAA 

AATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTA 

ACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGT^ 

TTATGCATATGACATTAATTGCAATAAATA^ 

ATTTGCATTCATATCTAGGTCGAGTGCTGC^^ 

AGCXSAAGGAATAGCAAATTTATCAAGCTCCT^ 

CTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTC 

ATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCG 

GCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATC 

TTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAA^ 

CCAAAAAGAGCTTGTAAACAAATAA 

fl47.aa 

MKI 1 1 1 GGTSAGTSAAAKANRLNKKLDITI YEKTNIVSFGTCGLPYFVGGFFDNPNTMISRTQEEFEKTGISVKTN 

HE\nCKVDAKNOTIVIKNQKTGTIFNNTYDQLMIAT^ 

VIIGGGYIGIEMVEAAKNKRKNVRLIQLDKHILIDSFD^ 

NKNTYQADAVILATGIKPDTEFLENQLKTTKNGAI IVNEYGETSIKNIFSAGDCATI YNIVSKKNEYI PLATTANK 
IjGRIVGENIjAGNHTAFKGTLiGSAS IKI LSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYE 
ENTKIILGAQAIGKNGAVIRIHALSIAIYSKLTTKEIX5MMDFSYSPPFSRTWDILNIAGNAAK 

tl47.aa 

AAAKANRLNKKIJ)ITIYEKTNIV^ 
IKNQKTGTIFNNTYI^I^IATGAKPIIPPINNI^ 

AAKNKRKNVRLI QLDKHILIDSFDEEIWIMEEELTKKGVNLHTNEFTO LAT 

GIKPDTEFLENQLKTTKNGAIin/NE^ 

AFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYTCT^ 

NGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

fl47.nt 

ATGAAAATAATAATTATTGGGGGCACATCAGCAGGAACTAGTGCCGCAGCTA 

TAGACATTACTATCTATGAAAAAACAAATATTC^ 

CTTTGACAACCCCAATACAATGATCTCAAGAACAC^ 

CACGAAGTTATCAAAGTAGATGCAAAAAACAATACAA 

ATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTATTATTC 

TTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAAAAA^ 

GTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAAGCAG^ 

TTCAACTAGATAAGCACATACTCATAGATTCCTTTGACGAAGAAATAGTCACA 

AAAGGGGGTTAATCTTCATACAAATGAGTTTGTAAAAAGTTO 

AACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTA^ 

AGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGTATGGC^ 

AGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAATGAATA 

CTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACAGCAT^ 

AAATACTATCTTTAGAAGCTGCAAGAAC^SGACTTACAG 

GATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCA 

GAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAAAATGGAGCCGTAA^ 
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TTGCAATCTATTCAAAACTTACAAGAAAAGA 
TTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 

tl47.nt 

GCCGCAGCTAAAGCAAACCX^TTAAACAAAAAGCTAGACATTACTATCTATG 

GAACCTGTGGCCTGCCTTACTTTGTGGGGGGATTCTTTGACAACCCCAATACAATG 

ATTCGAAAAAACTGGAATCTCTGTTAAAACTAACCACGAAGTTAT 

ATAAAAAATCAAAAAACAGGAACCATTTTTAACAATACTTACGATCAACT^ 

TTATTCCACCAATCAATAATATCAATCTAGAAAA 

AAAATTAATGGATAGAGAAGAGATTAAAAATATAGTGAT^ 

GCAGCAAAAAATAAAAGAAAAAATGTAAGATTAATTCAACTAGATAAGCACATACTC 
AAATAGTCACAATAATGGAAGAAGAACTAACAAAAAAGGGGGTT 

AATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACAAACAAAAATACTTATCAAGCTGACGCTC 

GGAATAAAACCTGACACTGAATTTTTAGAAAACCAGCTT^ 

ATGGCGAAACTAGCATAAAAAATATTTTTTCTGCAGGAGATTC 

TGAATACATACCCTTGGCAACAACAGCCAA 

GCATTTAAAGGCACATTGGGCTCAGCTTCAATTA 

AAGATGCAAAAAAGCTCCAAATAAAATATAAAACGATTTTTGTAAAGGACAAAAATCATACA 
CCAAGAAGATCTTTATATTAAATTAATTTATGAGGAAAATACCAAAA 
AATGGAGCCGTAATAAGAATTCATGCTTTATCAATTGCAATCTATTCAA 
TGGATTTCTCATATTCCCCACCCTTCTCAAGAA^ 

fl52.aa 

MLKFEFSDRFLLFSYFVLIMFIGSLIJ^LPISWEGIXSKLAYIDALFTAVSAVSITC 

IQLGGLGFISITTFYLLIPKKKMNLTDARIIKQYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRG 
SFLEALFTTI SAFCNAGFSMHSES I YAWRDVPEAIWVS I L 1 1 CGGLGFMVYRDVl^TI KNKKKLSLHAK I VFS LS 
FFLI I IGAI LFFFTEMHKLKAGYSMSTLIFNS I FYS ISTRTAGFNYLDNSLI SGRTQI I SLPFMFIGGAPGSTAGG 
I KITTFFLI VLAWKNQNGNG YI IGS YKVS IDS IRFALLFFARAI F I LSFSFFMLLFFEGGSGNWKVT DLG YEVF S 
AFGTVGLSVGVTQDLSFWGKVI 1 1 FTMFAGRIGLFSMAVFVSRKSRFEEFTRPRQDI LVG 

tl52.aa 

WEGDGKIAYIDALFTAVSAVSITGLTTVKMEGFSTFGFI^^ 

QYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFK^ 

EAIVWSILIICGGIiGFMVYRDVNNTIKNKKKLSLHAKIVFSLSFFLIIIG^ 

IFYS I STRTAGFNYLDNSLI SGRTQII SLPFMFIGGAPGSTAGGIKITTFFLIVLAVVKNQNGNGYIIGS YKVS ID 
SIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFSAFGWGLSVGVTQDLSFWGOT I IFTMFAGRI 
GLFSMAVFVSRKSRFEEFTRPRQDILVG 

f!52.nt 

ATGTTGAAATTTGAATTTAGCGACAGGTTTTT 

TGTTGATGTTGCCTATTTCCTGGGAAGGTGATGGCAAATT 

TGTAAGTATTACGGGCCTTACAACGGTTAAAATGGAA 

ATCCAGCTTGGGGGACTTGGATTTATAAGTATTACTACTTTTTA 

CAGATGCAAGAATAATAAAGCAGTATTCCCTTTCAAATATAGAATATAATC 

GTTTATAACTTTTTCAATTGAAATGATAGG 

TCATTCTTAGAGGCTTTGTTTACGACAATTTC 

ATGCATGGCGAGATGTTCCTGAAGCTATAGT^ 

CTATAGAGATGTAAATAACACTATTAAAAACAAAAAAAAACTATCGCTTC 

TTCTTTTTAATTATAATTGGTGCAAT^ 

GCACTTTAATATTTAATTCAATTTTTTATTCGATTAGTAC 

AATAAGCGGAAGAACTCAAATAATTTCTCTACCATTCATGTTTAT^ 

ATTAAGATTACAACATTTTTTTTAATTGTATTGGCTG 
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CTTACAAGGTTTCAATAGATAGTATAAGATTTGCACTTTO 

TTTTTTCATGCTTCTTTTTTTTG^ 

GCTTTTGGAACGGTTGGTCTTTCAGTTGGAGT 

CTATGTTTGCAGGACGAATAGGGCTTTTTTCAATGGCTGT^^ 

AAGGCCAAGGCAAGATATTTTGGTTGGTTGA 

tl52.nt 

TGGGAAGGTGATGGCAAATTAGCATACATTGATGCTC 

CAACGGTTAAAATGGAAGGCTTTTCTACTTCT 

ATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAA 

GAGTATTCCCTTTCAAATATAGAATATAATCCT 

AAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTO 

TACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTC 

GAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTG 

CTATOAAAAAGAAAAAAAAACTATCGCTTCATC 

TGCAATTTTATTTTTTTTTACAGAGATGCA 

ATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAAOT 

TAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATC 

TTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATAT^ 

AGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATT^ 

TTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTA 

TTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTC^ 

GGGCTTTTTTCAATGGCTGTTTTTGTTTCA^ 

TGGTTGGTTGA 

fl54.aa 



MKINKTFII^FLFTKFSFVQAQANQILTEISPLSIL^ 
KYITDKVKRRDFKIRLDKNSLYAIIWGTKNE^ 

AKLKKYVLRIYKSNIYIAYQLENSDDIKVAEFIEDVGWFNLDSSVNRNITO 

FASELIVKKFNSRKWIDI S PGHI ENFGSLLNI S IDLKDRLYLAYLREIRGEYKINLI SNMGYGS IWTDVIHAYLSK 
GDSNVNSSNIGLISEPFLGIFYNYKSNNEIKSEFIVNNENAWVNAN 
PIWICPLKSSRWINISPNVEMEGLSADIGLYKNNLFLAFEDNNNVRLIW 
YGNQGLVISTLSSNSNELFFTLICQ 

tl54.aa 

NQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNKKYITDKVKRRDF 
IIYVGTKNENIKFSLTDIJ)FSILSSDSLKAKTSK^ 

SDDIKVAEFIEDVGWFNLDS SVNRNITNXVNFDFS INSKGNLYI AFVTKSGADFASELIVKKFNSRKWIDI S PGHI 
ENFGSLI^ISIDLKDRLYIAYIiREIRGEYKINLISNMGYGSIWTO 

YKSNNEI KSEF I VNNENAWVNANI PSVYMANFI KGFFDSNFNQI IMSFVSENRPI VNICPLKSSRWINISPNVEME 

GLSADIGLYKNNLFLAFEDNNNVRLIYFKNKNWYFLNKLEOT 

CQ 

f!54.nt 



ATGAAAATAAATAAGACATTCATTTTGCTATTTTTATl^ 

TATTAACAGAAATTAGTCCTTTAAGTATT^ 

TTCCGATTATATTTTAACCCTAGATAAGAGTTC^ 

AAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAA 

ATGTTGGTACTAAAAATGAAAACATAAAGTTITCGCTTACAGATTO 

GAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTT^ 
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GCCAAGCTTAAAAAATATGTATO 

ATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTT 
TATAGTTAATTTTGATTTTTCAATTAATTCT 

TTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGA 

TTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGT^ 

ATATAAAATTAATTTAATCTCGAATATGGGTTAC 

GGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATArc 

CAAATAkTGAGATTAAATCTGAATTTATTGT 

TATGGCCAATTTTATTAAAGGCTTTTTTGATTC 

CCTATTGTAAACATTTGTCCTTTGAAAAG 

GTGCTGACATTGGGCTTTATAAAAATAATTTGTT^ 

TAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTT^ 

TATGGGAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGA 

GA 

tl54.nt 



AATCAAATATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGG 
GCAAATCTTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCC 

TAATAAAAAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTC 

ATAATATATGTTGGTACTAAAAATGAAAACATAAAGTTT^ 

ATTCCCTGAAAGCTAAAACATCTAAGATTGA 

TTTAACTGCCAAGCTTAAAAAATATGTATTAAGGA 

AGCGATGATATTAAAGTTGCTGAATTTATTGAGGATG 

TTACTAATATAGTTAATTTTGATTTTTCAATTAATTC 

GGCTGATTTTGCCAGCGAGCTTATAGTTAAAAAATCT 

GAAAATTTTGGATCTTTATTAAATATTAGCATTGATTTAA 

GGGGTGAATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACC 

AAGTAAAGGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATC 

TATAAGTCAAATAATGAGATTAAATCTGAATTTATTGT^ 

CTGTTTATATGGCCAATTTTATTAAAGGCTTTTTTGA 

AAATAGACCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGAT^ 

GGTTTAAGTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTO 

TTTATTTTAAGAATAAAAATTGGTATTTT^ 

TGGAATTTATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTC 

TGCCAATGA 



f!57.aa 



MKIFLKVTGRGILGRLMVFRKNYDYLALISLLrVSFVGILLIY^ 

KYDLKFVYSMVYPLYFLLI LAL I FTAFFGMTWGARSWIGIWKLGGQPSEFGKVVI I LTLSKFYTEKKGYNEFFTF 

ITAFLLIFPSVILILLQPDFGTAIVYLTIFIFISFFAGI^^ 

IFSNPFYFRVIMGVLLLILLISVLGFFISKYGLSIKIIYFYWF^ 

PAIDAKGAGWNLNQVKIAIGSGGLLGKGFLKGPYTHAOT 

IIMNKSQDRYMALVISGILGLLFFHTSFNVGMSIX3^ 



tl57.aa 



RKNYDYIJOilSLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWIIGFFLIFIVGKYDLKFWSM 
lALICTAFFGMTVNGARSWIGIWKLG^ 

FGTAIVYTjTIFIFISFFAGIDLHYVLAFALIGFFSFVFAILPVWY^ 
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TABLE 1. Nucleotide and Amino Acid Sequences 

LI SVLGFFI SKYGLS IKI I YFYVFFAS S I LLVS I VFSKVLSKLMKTY QIKRFLVFLDPAIDAKGAGWNLNQVKI AI 
GSGGLLGKGFLKGPYTHANWPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFLIIMNKSQDRYMALVISGIL 
GLLFFHTSFNVGMSIX5VLPITGI PFPFLSYGGSSTITFFtAMSF YFNIES IVAMD 



fl57.nt 



ATGAAGATATTCTTAAAGGTTATAGGCCGTGGTATACT 

TGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTT^ 

ATCTTTAACCAAGAATGAATATATAAAACAAACCT^ 

AAATATGATTTAAAATTTGTTTATAGCATG^ 

CATTTTTTGGAATGACAGTAAATGGAGCAAGLA/TC 

TGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGA 

ATTACTGCATTTTTATTAATTTTTCCATCGGTAATTC 

ATTTAACCATTTTTATATTTATTTC 

GTTTTTTTCTTTTGTTTTTG 

ATTTTCTCAAATCCTTTTTATTTT^ 

TTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTO 

TTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATC 

CCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAG^ 

GCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCA 

TCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGC^ 

ATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAA^ 

CTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTA 

TTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTT 



tl57.nt 



AGAAAAAATTATGATTATTTGGCTTTGATAAGCCT 

GCGATTATAATATTAGTGGATCTTTAACCAAGAATGAATATATAAAACAAAC 

TCTAATTTTTATAGTGGGCAAATATGATTTAAAATTTGTT^ 

TTGGCTTTAATTTTTACTGCATTTTT^ 

GAGGACAGCCTTCTGAATTTGGTAAA^ 

TAATGAATTTTTTACCTTTATTACTGCATT^ 

TTTGGTACAGCAATAGTATATTTAACCATTTTTATATTTATTTC 

TAGGATTTGCGTTGATAGGGTTTTTTTCTT^ 

GGGTAATGTATTTTATCTTATTTTCTCA^ 

TTGATTTCTGTTTTAGGATTTTTCATT^ 

CAAGTTCTATTTTATTAGTTTCAATAGTGTTTTC 

GTTTTTGGTATTCTTAGATCCGGOTATTGATGCTAA 

GGTTCTGGCGGTCTTTTGGGCAAAGGATTTTO 

CAGATTTTATTTTTTCTATTCTTGCC 

CCTTTTTTTTAAATTTTTGATAATAATG^ 

GGACTTTTATTTTTTCATACTTCTTTTAATGTTGGAATC 

CTTTTCTCTCTTATGGAGGTTCCT 

AGTTGCTATGGATTGA 



fl7.aa 



MIWLFFSIYLIILFKRS5^SPLYFVPDTKFETLSIRIVLSC^ 
FLLKSIFVRVLIST^SLPTKGSNFLAFASAVKFLTYFPISKCSFSSRISSSNSL 
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tl7.aa 



PLYFVPDTKFETLSIRIVLSCSLLLIFFCT^ 
NFLAFASAVKFLTYFPISKCSFSSRISSSNSL 



f!7.nt 



ATGATTGTGTTTTTGTTTTTTTCAATA 

TTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTC 

TATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCA 

TTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTC^ 

TTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATT^ 

TTCTTTGTAG 



tl7.nt 



CCTCTATATTTTGTTCCCGATACCAAGTTTGAA^ 

TTTTTTTTTGCACTATGCTTGATGCAAGGCCTTCAACT^ 

TGCACTATTTTTATTTCTTCTCAAGAGTATATTTGTAAGAGTTT^ 

AATTTTTTGGCTTTTGCAAGTGCTGCT^ 

TTTCTTCATCAAATTCTTTGTAG 



fl70.aa 



MKAFKVKNLRRFSNF IRI LVIVLFLNSLLSLFVFLAGSYNI FVYNFQKFYLDLAI ILSSVSFGLESTRLI FFYFLK 
NKKIKYYLILIFSFI IFFIALVFKIFLSGNK 



tl70.aa 



YNIFVYNFQKFYLDLAIILSSVSFGI^STRLIFFYFLKNKKIKYYLILIFSFIIFFIALW 



fl70.nt 



ATGAAAGCTTTTAAAGTAAAAAATCTAAGACGTTTTTCAAATTT^ 

ATTCTTTGTTAAGTTTGTTCGTGTTTTTGGCTGGTTC 

TGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTG^ 

AATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAGTTTTATA^ 

TTCTTTCTGGTAATAA 

ATAG 



tl70.nt 



WO 98/59071 



PCT7US98/12718 



67 

TABLE 1. Nucleotide and Amino Acid Sequences 

TACAATATTTTTGTTTACAATTTTCAGAAATTTTATC 

TTGAATCTACTAGACTGATATTTTTTT 

TTTTATAATTTTTTTTATTGCTCTTGTT^ 



f!86.aa 



MKKLI 1 1 FTLFLSQACNLSTMHKIDTKEDMKI LYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTL 
FGTTPMQRI HKYDQS FNLTRE I LASGI ELNRWNAWLNS PSHKEAL INTDTDKI GGYRLKTTDNIDI FWLFGKRK 
YKN 



tl86.aa 



TMHKIDTKEDMKI LYSEI AELRKKLNLNHLE IDDTLEKVAKEYAIKLGENRTITHTLFGTTPMQRIHKYDQSFNLT 
REILASG I ELNRVVNAWLNS PSHKEAL INTDTDKI GGYRLKTTDNIDI FWLFGKRKYKN 



f!86.nt 



ATGAAAAAATTGATTATAATTTTTACACTGTTTT^ 
CAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAA 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATA 
TTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCC^ 

GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAGCTCTT 

CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATAT^ 

TATAAGAATTGA 



t!86.nt 



ACAATGGATAAAATAGATACAAAAGAAGATATGAAAATTC^ 

ATCHAAACCATCTAGAAATAGATGATACCCTTGAAAAAG 

AACAATAACTCACACCCTTTTTGGCACAACCCC^ 

AGAGAAATACTGGCATCAGGAATTGAACTTAA^ 

CTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAA 

TCTTTTTGGAAAAAGAAAATATAAGAATTGA 



fl96.aa 



MKLKARMLLLVLILIAFFISILFFAFGmilNSKLVDQQFNLMINLIESIKSSFNLYIS 

FNEASKIKSKRLSFI SDQSEILIQTGSNMMVTDKEGKrVFTTAVKDNSDFGKS^ IVYNSFVM 

LADPGSIEESLLKDISKIKNKKGQIPYILIGMPLRDFETDNIFGYFMFLYSMDY 

TTGRLLVHHWLPGDILTDISASYSNIIKICrSEDLL 

RTSEDDFYYMSRATTI I IJ^SFVFTLLMLAIATLYLVKKLSSSLNKI LEYSERLASGNFTADINFGKWDTVELYSL 
YEGLEQLRTNFS SVAKGVT ENLDYL YENAI QI ANASQNLS SGAVEQASTLEQMTANI EQ I S QGVS ENTENAATTEK 
IAVNTNERTKEGHKSVVKAIEAMTVITEKIGI IDEITRQTNLLALNASIEAARVGEKGKGFEWAAEVRKLADQSK 
ESAREIIDIANRSLTVASRAGENFEQIVPGMEQTARLVKNISNESYKQSV^ 
SAMSEKMLESVKDLKESVDYFKIEK 
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tl96.aa 



MLINSKLVDQQFTOiMINLIESIKSSFNLYIS 

NMM\TTDKEGKI VFTTAVKDNSDFGKS IGDREYFTKLKESNSIVYNS FVMLADPGS I EESLLKDISK IKNKKGQI PY 
ILIGMPLRDFETDNIFGYFMFLYSMDYIYRSFRGINFGILSSGRAIA^ 

I KKTSEDLLQKNKE I STVYYYDPKSNKKYVG I SQKVLLNLSNNKF I LLMRT S EDDFYYMSRATT 1 1 LAI S FVFTLL 
MLAIATLYLVKKLSSSLNKILEYSERLASGNFTAM 
NAIQIANASQNLSSGAVEQASTLEQMTANIEQISQGV^ 
EKIGIIDEITRQTNLLALNASIEAARVGEKGKGFEWAAEVRKLADQSKE 

VPGMEQTARLVKNI SNE SYKQSVQ I EQFKNAI EQVSQLVQTTAS S S EEL SAMS EKMLESVKDLKESVDYFK I EK 



fl96.nt 

ATGAAGCTTAAAG^TAGGATGTTGCTACTTGTTC 

TTGGAATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTO 

TTCTTTTAATCTTTACATCTCTTCAATGGAAGAGAAAGT^ 

TTTAATGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATC 

GTAGTAATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGA 

CAAATCTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGI^ 

TTGGCAGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATA^^ 

CTTACATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACAT^ 

AATGGATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATAC 

ACTACGGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTGA 

ATATTATTAAGAAAACATCTGAAGATTTGTTGCAAAAGA^ 

AAGCAATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTC 

AGAACTTCAGAGGACGATTTTTATTACATGTCACGAGOTACAAC 

TACTTATGCTTGCTATTGCAACTCTTTATCTTGTGA 

TGAGAGACTTGCTTCTGGTAATTTTACTGCTGATAT^ 

TACGAAGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAG 

ATGAAAATGCAATTCAAATAGCAAATGCAAGCCAGAATTTA 

GCAAATGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTC 

ATTGCTGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTT 

TTACTGAAAAAATTGGAATTATTGATGAGATAACAAGGCAAA 

TGCACGAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTC 

GAATCAGCAAGAGAGATTATTGATATTGCAAACAGAAGTTT^ 

AAATAGTTCCTGGTATGGAACAAACAGCGAGACTTC 

AATAGAGCAATTTAAAAATGCAATAGAGCAGGTTAGTC 

TCTGCAATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAA 

AA 

tl96.nt 



ATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATC 

TTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTC 

TGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATC 

AATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTT 

CTATTGGGGATAGAGAATATTTTAC^AAACTTAAGGAGTCTA^ 

AGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAG^ 

ATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAAC 

ATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTC 

GGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTO 
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ATTAAGAAAACATCTGAAGATTTGTTGCAAAAG^ 
ATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAA 

TTCAGAGGACGATTTTTATTACATGTCACGAGCTACAACTATAATCTTAGCAACT 

ATGCTTGCTATTGCAACTCTTTATCTTGTGA 

GACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCA^ 

AGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTT 

AATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTO 

TGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTCTGAGAATA 

TGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTC 

GAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGC 

GAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAG^ 

AGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAA 

GTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAG 

AGCAATTTAAAAATGCAATAGAGCAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGA 
AATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTC 



f899.aa 



MRFIIAFI^ILNQGFSNLFSLPPEDIIFESSYEVAIKKAQKLNKNVLILVGRDIKENLIKDF 
RKSVFLVIDKDNEIFNKINLQKSPTIFFVDSK^^ 

NTADERTFFYKTLKGDWRLKFNGKDRKLVLFDTDLKEFLWKDINENKLYAIPKSRIGNIYFSLLGNEEWKLFGKI 
K 



t899.aa 
f899.nt 



ATGAGATTTATAATTGCATTTTTAATGATTTTAAATCA^ 

TTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTC 

TAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAAC 

AGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTT^ 

TTTTTTTTGTTGATTCTAAGAATGAGCA^ 

TAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTO 

AATACTGCTGATGAGAGAACCTTTTTTTACAA 

GAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGT^ 

TATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATO 

AAATAA 



t899.nt 



TTGCCTCCGGAAGATATTATTTTTGAGAGTTCTTATGAGGT^ 

TTTTAATTTTGGTTGGTAGAGATATTAAAGAAAATTO 

TATTCACAAAGTATCTAGAAAAAGTGTTTTTTTAGTTATTGAT 

CAAAAAAGTCCGACTATTTTTTTTGTTGA 

GCAGTGTTCAATTTGATAAGGATTTTTTAAACTATGTT 

AAAAGATTATGAAATTAATACTGCTGATGAGAGAACCT^ 

TTTAATGGTAAT^ACAGAAAGCTTGTTCTTTTT^ 

AAAACAAGCTTTATGCTATTCCTAAGTCTA^ 

GCTTTTTGGAAAAATAAAATAA 
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f924.aa 



MQDRKFSFRKYFLI SVFLI F IVSG I TYFYSTQMLEKSQKCVEDNLDAKVKLVDMEDH^FDLNECLNMDDFF I PRPD 
FLNENLNKNLVVIXSLIKNKFIiDENFFKDLWIKKENLFNTO 



t924.aa 



TQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECLNMDDFF I PRPDFLNENLNKNLVVDGLI KNKFLDENFFKDLW 
IKKENLFNVDIEKENEKLIDKILEISK 



f924.nt 



ATGCAAGATAGAAAGTTTAGTTTTAGAAAATATTTTTTAATTTCAGTAT^ 

CTTATTTCTATTCAACACAAATGTTGGAAAAATCTCAAAAG 

AGTTGATATGGAAGATTTTTATTTTGATTTAAATGAATGTCTAAATA 

TTTTTAAATGAAAATTTAAATAAGAATTTAG 

TCAAGGATCTTTGGATTAAAAAGGAAAATTTATTT^ 

GATTTTAGAAATTTCCAAATGA 



t924.nt 



ACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAG 

ATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATG 

TTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAA 

ATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATO 
CCAAATGA 

f925.aa 

MIRKYLIYISLLFIWEVYSKPAFISQDDSYELDFSSGEVDISVOT 

SYDNGAVFTFQTFKKEGKIKLVFTYQNVKDSSEFNKII I LKITKNFEVAI PQGVGGGSSRDNNI ETGNNLELGGGS 
I SGATSKE I IVRALNLS YINDYKGAIDLLlNKYNFNDDKYI LLKAE I HYKNGDYLKS YENYLKLKSKYFQS I VFDLI 
RLAIELNIKEEVLENARYLVEKNVDFSESIYLEIFE 
SESKHKDFLKALHYYKLVIDNYPFSYYYERAKIRYLFLKRFF 

t925.aa 

KPAFI SQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIY 

LVFTYQNVKDSSEFNKI 1 1 LKITKNFEVAI PQGVGGGSSRDNNI ETGNNLELGGGS I SGATSKE 
DYKGAIDLLNKYNFNDDKYILLKAEIHY^ 

EKNYDFSESIYLEIFEFLVTRGEHEFAliNFSSLYFPKYINSSFSDKYSYLLGKLYESESKHKDF^ 
NYPFSYYYERAKIRYLFLKRFF 

f925.nt 

ATGATTAGAAAATATTTGATTTATATAAGTTTGCTATTO 

GTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGAT^ 

TCTTTCTTTTAAAGATGAGTCTTGGATTTATATCA 
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TCTTATGATAACGGTGCTGTTTTTACTTTTC 

AAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTA 

ACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATC 

ATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTC 

TAGATTTGCTTAATAAGTATAATTTCAATGACGATAAA 

TGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGA^ 

AGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATOT 

TTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCOT 

CTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTC 

TCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATA 

ATTATTATGAGAGAGCCAAGATAAGATATTTATT^ 

t925.nt 

AAGCCAGCTTTTATAAGTCAAGACX5ATTCGTATGAGCTTGATTTO 

CCAATTCAAAATTTAATCTTTCTTTTAAAGATGAGTC 

TAAGTTAATTGGAGAATCTTATGATAACGGTGCTGTTTTTA 

TTGGTTTTCACTTATCAAAATGTTAAAGATTCAAGTGAATTO 

TTGAAGTTGCAATTCCACAAGGCGTTGGTGGTGGCTCTAGCA 

ACTTGGGGGGGGGAGTATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAG 

GATTACAAAGGAGCAATAGATTTGCTTAATAAGTATAATTTCAATGACGATA/^ 

TTCATTATAAAAATGGTGATTATTTAAAATCTTATGAAAATTATT^ 

TGTTTTTGATCTAATTAGGCTTGCTATAGAATTAAATATTAAAGAAGA^ 

GAAAAGAATGTTGATTTTTCTGAGAGCATTTATCT^ 

TTGCTTTAAATTTTAGCTCTCTTT 

GGGAAAACTTTATGAGTCTGAGAGCAAGCATAAAGATTT^ 
AATTACCCTTTTAGTTATTATTATGAGAGAGCCAAGATAAGATATTTAT^ 



f929.aa 



MTKVLWSAI ALLSKDKELI PFYKFLFLFFFFTLLACSKVSKDF IVFNKDVKTSSRI DNPNSNVLEVNKMEDFFGD 
I IDLKGYKI LSVQQENLNLDVYFEQVVIAQNFSNLNAYLFI IGFDP 

DYDFNIVIQGFLKDKSVLYVFQKSVLNDVS SYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPI SLEKYEKVGE 
DLIISKIEKYEYSWQGRYCLSSVSEKVGKID1WIYKTLKNI>SKDEVYKF3^G 
FERQSSEINLFRKNSQEVAKIEYISKPAYNTLNVSAKSLFSDLIVYNFWIKIVDKEN^ 
TFKRFDENVLNVKKGSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYGIFNIFPLK^ 
VESFFLEHSERIVQKQKFSTI ILNPIKI LKDDVSLVKGQKLKLERIEKI 



t929.aa 



KDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEV^ 
ENIiNLDfVYFEQVVLAQNFSNLNAYLFIIGFDPKIKAGTILFKT^ 

KSVLYVFQKSVLNDVS SYRPIFFDKVNGTVLINKYARSSAYEENRSRESYP I SLEKYEKVGEDLI I SKI EKYEYSN 

VQGRYCLSSVSEKVGKIDimiYKTLKNLSKDEVYKFLHGVWYDVHDYNKMm 

SQET7AKIEYISKPAYNTLNVSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTN 

GSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYG 

KQKFSTI ILNPIKI LKDDVSLVKGQKLKLERI EK I 



f929.nt 
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ATGACAAAGGTTTTGGTTGTTAGTGCGATTGCT 

TGTTTTTATTCTTTTTTTTTACA 

AAAGACTTCTTCCAGGATCGATAATCCAAATTCC 

ATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAGG 

TGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATA 

AACGATTCTTTTTAAAACTCAAATAGATATO 

GATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGATA 

TAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAGTO 

ATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCCTATO 

AAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAAAGAATTT^ 

GCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCATO 

TTTGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAATTCTC 

AACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTOT 

AATTGTAGATAAAGAAAACATTGAAATCAAAATTGAGACTAGC^ 

ACATTTAAGAGGTTTGATGAGAATGTCTT 

ATTACGTGTATAAGGATAAAATTTATGATTTTTCTTACCCCCATT^ 
TGGCATTTTTAATATTTTTCCTTT^ 

GTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAAAAGCAAA 
TTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAAAATTA 

t929.nt 



AAGGATAAAGAATTAATCCCATTTTATAAATTTTTGTTTOT 

TAAGCAAAGATTTTATTGTTTTTAACAAAGATGTAAAGACTTCTTC 

AGAAGTTAATAAAATGGAAGATTTTTTTGGAGATATTATAGATT^ 

GAAAATTTAAATTTAGATGTGTATTTTGAGCAGGTGGTTT^ 

TTATTATTGGTTTTGATCCTAAAATTAA 

TTCTTATAACATGTATCTTGAAGATATTACAGGTGATTATGATTTO 

AAATCTGTTTTGTATGTTTTTCAAAAATC 

TTAATGGAACTGTTCTTATTAATAAGTATGCAAGATC 

TATTT CT TTAGAAAAATATGAAAAAGTGGGGGA 

GTTCAGGGTAGATATTGTCTTTCTTCTGTGAGC 

AGAATTTAAGCAAAGATGAAGTTTATAAATT^ 

TGTCAAAGATATTGATGAAGTTTTATTCTTGTC^ 

TCTCAAGAAGTTGCAAAGATTGAATATATTTC^ 

TTTCAGATTTGATAGTTTATAACTTTTGGATCAAAA 

CACAAATTCTTATC^TAATAGTGGATTTTC 

GGGAGTAGTGATATTTATTTTATTCCTAGTGGAAA 

ATTTAACTTATATTGATGAGAATAAAATTTATTATGGCATTTTT 

TGAATATGAGATTGACATGGGTAGTTACAAGCTTGTTGAATC 

AAGCAAAAATTTTCTACAATC^TTTTAAATCCTATTAAAA 

AATTAAAGCTTGAGCGAATAGAAAAAATATGA 



f933.aa 



MNKLLIFVLATFCVFSSFAQANDSKNGAFGMSAGEKL^ 

GFDAVGIGLILAGAYLDIKALIX5ITKKAAFQWTWGKGVMIA 

GGFEPSFDVAMGQSSALGFELSFKKSY 



t933.aa 
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NDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFIjGFGIGSFAQGDI lggsli lgfdavgigl ilagayldikal 
IX3ITKKAAFQWTWGKGVMLAGVVTMAVTRLTE 1 1 LPFTFANSYNRKLKNSLNVALGGFEPSFDVAMGQSSALGFEL 
SFKKSY 



f933.nt 



ATGAATAAACTTTTAATTTTTGTTTTGGC 

ATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTT^ 

TTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTC 

GGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTT 

AAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTC 

AACAGAAATTATTCTTCGATTTACATTTGCT^^ 

GGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCC^ 

GCTATTAA 



t933.nt 



AATGATTCTAAAAATGGTGCGTTTGGGATGAGTGCTGGAGAAAA^ 

CTATTGTACCATTTTTATTGAACCTTTTTTTAGGGTT^ 

TTCTCTTATTCTTGGATTTGATGCGG 

GATGGTATTACTAAAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGT 
CTGTGAGAAGATTAACAGAAATTATTCTTC 

TAATGTAGCTTTAGGAGGATTTGAACCTAGTTTTGATGTTGCAATGGG 
TCTTTCAAAAAAAGCTATTAA 



£940. aa 



MRKYIFIILIAVLLIGVNIKKIAAAANIDRHTO^^ 
LAIGLEI^YMFNFDINHSFNILNPDSSVGKIF^ 

RTFDALPTISFGSGILVWFNYKWAFGATASWWMMFEFGNSAKMAHFA^ 



t940.aa 



ANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQra 

SSVGKIFYSVPITFSINYIFDIGEI^QIPVFTNIGFSIOT 

GATASWWMMFEFGNSAKMAHFALVSLSVTVNVNKL 



f940.nt 



ATGAGAAAGTATATTTTTATAATACTAATTGCAGTCTTGCTAATTGG 

CCAATATTGATAGGCATACAAACTC(^CTTTAG 

ATCAAAAGCTTATCCTACCAATTTATATCCA 

TTAGCAATTGGACTTGAACTAAGGTATATGTTO 

CAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTC 
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TCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACA 
AGAACTTTTGATGCACTCCCTACAA 

GAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTC 
ATCAGTTACAGTGAATGTAAATAAATTGTAG 



t940.nt 



GCCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCC 

TATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAAT 

TTTAGC^TTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAAC 

TCAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATO 

TTGAAATTCCAGTCTTCACAAATATAGGGTT^ 

AAGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAA 

GGAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGA^ 

TATCAGTTACAGTGAATGTAAATAAATTGTAG 



f943.aa 



MKNQFLNSYFQLITTIFLISSITIAAEEITSTLKVPNGFKra 
KNRKIYTIAKTIiQKPIGIDYWDNKLYISSVDKI 

KVDSKNNKLIVNIGSQHNVKIPPKKEAVILSINLKTKKEEIVAFGVRNSV 
PDEINVITEYKEHFGFPYWGKNQKNYGFTNKAPKNTKFIPSIYELPAHVAPI^IHF 

GSWNRSSPVGYKITTLDIDSKTRTARNYKTFLYGFLKHDKSKFGRPVDI ITYYDGSI LFTDDFGNKI YRVYYEKI 



t943.aa 



EITSTLKVPNGFKVEIFLOT^IEKPRGITSDQDGMFIGSGSTFAYFVTKNRKIY^ 

SSVDKIYWKNVKEEINKS IKSHKDYTWKMQI FALLPKNNSQMHSGRYI KVDSKNNKLIVNIGSQHNVKI PPKKEA 
VI L S INLKTKKEEI VAFGVRNSVGFDFHP I SNEI YFSDNGQDGLGDNI PPDE INVT TEYKEHFGFP YVFGKNQKNY 
GFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPK^ 
YKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



f943.nt 



ATGAAAAATCAATTTTTAAATAGCTATTTTCAATTAATTACAACTATO 

CAGAAGAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATT^ 

ACCTAGAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATC 

AAAAACAGAAAAATTTATACCATAGCAAAAACCCTGCAAAA^ 

ACATATCTTCTOTCGATAAAATATATGTAGTTAAAAATC 

AGACTATACATGGAAAATGCAAATTTTTGCACTTTTG^ 

AAAGTAGATTCTAAAAATAACAAATTAATAGTAAATATA^ 

AAGCAGTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTT^ 

GTTTGATTTTCACCCAATTAGCAATGAAATATATTTTAGCG 

CCAGATGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTA 

ATTACGGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATC 

TCCACTTGGAATACACTTTTACCGGGGAAATAAC 

GGCTCXSTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACi^ 
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GAAATTACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATC 
ATATTATGACGGTTCAATTCTTTTTACAGATGACTT^ 



t943.nt 



GAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGA^ 

GAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGG^ 

CAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTA 

TCTTCTGTCGATAAAATATATGTAGTTAAAAATGTA^^ 

ATACATGGAAAATCCAAATTTTTGCACTTTTGCCAAAAAAT^ 

AGATTCTAAAAATAACAAATTAATAGTAAATATAGGATC 

GTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGT^ 

ATTTTCACCCAAOTAGCAATGAAATATATTTTAGCGACAATGG^ 

TGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTC 

GGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTO 

TTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTA 

GTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTA 

TACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATCTA^ 

ATGACGGTTCAATTCTTTTTACAGATGACrTTGGAAATAA^ 



f952.aa 



MNYARFAVLIVLLFFYIWFFI ILRMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINI PFNNLFAKKDKLGD 
FESPIIVYGKSFNKSYEAKKVLKSMGFKNVFVAGTLKDMPQAKKEVG 



t952.aa 



RMKRTNLFIJuEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGDFESPIIWGKS 
SMGFKNVFVAGTLKDMPQAKKEVG 



f 952. nt 



ATGAATTATGCAAGATTTGCAGTATTAATAGTTCT 

AAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAAT^ 

TAGCAAGTCTCATTATTTGAAGTCAATTAACAT^ 

TTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAAT 

GATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATC 



t952.nt 



AGGATGAAAAGAACTAATCTGTTTTTGTTAGAAAAAA 
AAGAATATAGCAAGTCTCATTATTTGAAGTCAATTA^ 
AGGTGATTTTGAGTCCCCAATAATTGTTTATGG 
AGCATGGGATTTAAGAATGTGTTTGTTGCTGGAA 
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TABLE L Nucleotide and Amino Acid Sequences 
f378.aa 



MIKKFLLFAMLNI FI/TNKAHSNEEI IEI STEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYI 

DESLIEGVNYDIAYAQMLLETGALKFNGIVSKEQHNF^ 

SNMVDPRFYLVKRGSAPTIYDLTGKWAKDKLYDKKLK^ILLELLEYNNANKS 



t378.aa 



NEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELD^ 

GALKFNGIVSKEQHOTSGIGATNNLTKGNSFSNI 

LTGKWAKDKLYDKKLKKILLELLEYNNANKS 



f378.nt 



ATGATAAAAAAATTCTTGCTATTTGCAAT^ 

TCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTA 

CCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATC 

GACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATC 

TCAATGGAATAGTTTCAAAAGAACAACACAATTTT^ 

TTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTC 

TCAAATATGGTTGATCCTAGATTTTACCITGTTA 

GGGCAAAAGACAAACTTTACGACAAAAAACTTAAAAAAATATTATTAGAACT 
AAGCTAA 



t378.nt 

AATGAAGAGATAATCGAAATAAGTACTGAAATACAAAAGGAA7VAATATATTC 

CTCAACTAGAAGACCTTGTAAAATATACTCTAGAAATAAATCC^ 

TAAAACOTATATAGACGAATCTTTGATTGAAGGGGTTAATTATGACAT^ 

GGAGCTCTAAAATTCAATGGAATAGTTTCAAAAGAACJ^ 

CAAAAGGAAATTCTTTTTCCAATATTACAGAAGGAA 

ACAAAATATCAAATCAAATATGGTTGATCCTAGATTTTACCT 

TTGACTGGGAAATGGGCAAAAGACAAACTTTACGACAAAAAACTTAA 

ATAATGCAAATAAAAGCTAA 



f 4 .aa 



MKLFRRNVMIKMPSSFTI IFSLIVFVTILTYVI PAGKFDKEFKQMGDGSKREI IVAGTYQYVDRGSRGFLHPIMTI 
LTAMSKGMEHAVEVTVFVLIVGGAYGI IMKTGAIDVGIYFLIKKLGHKDKLLI PLLMFIFSIGGTVTGMSEETLPF 
YFVMI PLIVAIX3YDSLVGAAI I AIiGAGVGTMASTVNPFATGIASAIAS I SI^DGFYFRIVLYFVSVIjAAITYVCVY 

askikkdpskslwsqkdehyqyfvkkdglstgdnaqnaleftfahklvlllrc fsi vnlgwwmqemtm 

lylgvaiisaficki/getemwdafvkgs 

neiiqifigfwpsssghasltmpimapij^ 

kfvlplfmieffisilvhaniylsf 



t4.aa 
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KFDKEFKQMGDGSKREI IVAGTYQYVDRGSRGFLHPIMTILTAMSKGMEHAVEVI VFVLI VGGAYGI IMKTGAIDV 
GIYFLIKKLGHKDKLLIPLLMFIFSIGGTVTGMSEETLPFYFVMI PLIVALGYDSLVGAAI IALGAGVGTMASTVN 
PFATGIASAIASISLQDGFYFRIVLYFVSVLAAITWCW^ 

QNALEFTFAHKLVLLLFGFMIL I LIFSIVNLGVAfiMQEMTMLYLGVAI I SAF ICKLGETEMWDAFVKGSESLLTAAL 
\n:GLARG^IVCDIX3LITDTMU^™FLYNLPRPLFIILNEIIQI^ 

RASWIAMQTASGLINLITPTSGVIMAVIX5ISRLSYGTWFKFVLPLFMIEFFISILVIIANIYLSF 



f4.nt 



ATGAAATTATTTAGGAGAAACGTTATGATCAAAATGCCAAGTAGTTTTACAATAA 

TTACCATTTTAACGTATGTGATTCCTGCCGGTAAGTTTGATA 

GGAAATAATTGTTGCTGGAACTTATCAATATGTAGATCGAGGCTCTAGG 

TTAACCGCAATGTCAAAGGGGATGGAACATGCAGTTGAAGTT^ 

GGATTATTATGAAAACTGGAGCAATAGATGTGGGAATTTATTTT^ 

GCTTATTCCTTTGTTAATGTTTATTTTTTCAATTG^ 

TATTTTGTTATGATTCCCTTGATAGTAGCTTTGGGTTATGA 

CTGGAGTGGGAACTATGGCTTCTACTGTAAATCCATTTGCGACAGGAAT^ 

GCAGGATGGATTTTATTTTAGAATTGTTC^ 

GCGTCTAAAATTAAAAAGGATCCCTCAAAATCGCTTGTC 

AAAAAGATGGACTTTCTACCGGAGATAATGCTCAGAAT^ 

TTTATTTGGATTTATGATATTGATTTTGA 

TTGTATCTTGGAGTTGCTATTATATCGGCTTTTATTTGT 

AAGGTTCTGAAAGTCTGCTAACCGCTGCTCTTGTTATTGGACTTGCT 

GTTGATTACAGATACTATGTTAAATGCTGCTACTAATTTTTTATACAATCTTC 

AATGAAATTATTCAAATATTTATAGGATTTC 

TGGCTCCTCTTGCCGATTTTTTGT^ 

TAATTTGATAACACCTACCAGCGGAGTTATAATGGCTGTATO 

AAGTTTGTTTTACCATTATTTATGATTGAGTTT^ 

TTTAG 



t4.nt 



AAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAGGGAAATAA^^ 
• TAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATTTTAACC 
- AGTTGAAGTTATTGTTTTTGTTTTAAT 

GGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGA 

TTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTITC 

GGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAC^ 

CCATTTGCGACAGGAATTGC^TCTGCAATAGCTTCTATTA 

ATTTTGTATCAGTATTGGCTGCTATAACCTATC 

GCTTGTGTATTCTCAAAAAGATGAACATTATCAATA 

CAGAATGCTCTTGAGTTTACTTTTGCTCAT^ 

TTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGAC 

TATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGT^ 

GTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTG^ 

CTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCA 

TGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAATGGCTC 

AGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGG 

TGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTTAAGTTTC 

TTTTATCTCTATTTTAGTTATTATAGCT 
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f43.aa 



MKYFYFLFFLLIFNVYAQNVNSPALPSPPLLPEI^ 

IKALKKSSDSQYNFSLKKRLEKTFNAELKREILEL^^ 

DKEKLKKTLIDILENKEGNWSIAAYYLGELNSLEYSKNMMEW 

SLDNYTSGPSIKAAAIEALSYLASDKVT 

KAINAIKGHRDSSAKDILIYKLKSDPSLKVR^ 

ALSIALEIVNKENINRPSNVIJIGVASMI^ 

TNSEYI KMLLKDY 



t43 .aa 



LPSPPLLPEITENKPVERENSSKGENFSNVGIJXSKYV]^ 

NAELKREI LELFI SLKYSGG I DTANYI LENYESKRYSNALFGLAI S YLKEFDDKEKLKKTL IDI LENKEGNWS I A 

AYYLGELNSLEYSKNMMEWEKYSGN 

KVTENADLYLQSNNNNLNVKIJ^IAS^ 

DPSLKVREASAKALIDMDIX3NIEIKNIMFDFKI^ 

ASMIAGKXGNFDNFYSKIIDSKNIDLJIHLALKGAVYNKSSSLSDKLKJ^ 



f43.nt 



ATGAAATACTTTTATTTTTTATTTTTTTO 

CTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTC 

TTCTAATGTTGGTTTAGATGGTAAGTATXOT 

ATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAAT^ 

CTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTC 

TCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCT^ 

GATAAAGAAAAATTAAAAAAAACTCTTATTGACATTC 

ATTATTTAGGAGAGCTTAATTCTCTTGAGTATTC 

TGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTC 

TCGCTAGATAATTACGAGGGCCCATCAATTAAGG 

TAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAA 

GTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTOT 

AAAGCTATTAATGCAATGAAAGGACATAGGGAC^ 

CATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGC^ 

TATGTTTGATTTTAAGATTGACAATAATTTTAAAAT^ 

GCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATACT 

CAATCTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTAT^ 

GCATTTAGCATTAAAAGGAGCTGTTTATAATAAA 

ACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



t43.nt 



CTTCCTAGTCCGCCOTTGTTGCCCGAAATTACAGAAAATAAGCCTCTTGA 

ATTTTTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATAC 

CATTATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTC 

AATGCTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTC 

ATATTCTTGAAAATTATGAGAGTAAAAGATATTC 
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TGATGATAAAGAAAAATTAAAAAAAACTCTTAT^ 

GCTTATTATTTAGGAGAGCTTAATTCTCTTGAGTATTCTAAAAACA 

ATGATGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGC 

AATTTCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATC 

AAAGTAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATC 
CTTTGTCCAAAGATCCTTCTTTAAAGTC 

GTTTAAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGG 

GATCCATCTCTTAAAGTTAGGGAGGCTTCTGCT 

ACATTATGTTTGATTTTAAGATTGACAAT^ 

AAAAGCATTGTCAATTGCTTTAGAAATTGTT^ 

GCTTCAATGTTGGCTGGTAAAAAGGGTAATTTTGATAAT^ 

TAAGGGATTTAGCATTAAAAGGAGCTGTTT^ 

TGAAACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



fSO.aa 



MKFVLNNLFKGCLICFFLFFSCLTTDRSIQDSHISDIVEKK^^ 
LSDAFLKENNFYFKKARESYAKKNIGLTNYYLNKIVT^ 

DYKYSHASLRLAELKYLVKEKSDAISAFKEINEFSISGYDREIYGFLSNKLGV 
NIFVTNILGGLLRYNIKKNIX^RVYLKDKKSIFI^GIRGFAD^ 

NVQVLLAVKNGIFVGTLNSGLWFYDLKNWKNI PLGSNKI SSLCFDSLKNLLLVGTVDKAI YSVNVDNLKKI EHLDF 
FSKNDNEKNINFIKEYKDSYFVGTYGGGLFELNLNKNSYKKH^ 
SENDNWDYFGPNNGLLNLNLIKVSRFENYVI LGTINNGLVFVDENIKKQL 



t50.aa 



CLTTDRSIQDSHI SDI VEKKKEAVI IDDNNVVLGSNBGKFKRDYLIGLKDNESFFLSDAFI*KENNFYFKKARESYA 
KKNIGLTNYYIoNKIVTNENQHSRELLAKANLFFGYVN^ 

SDAI SAFKEINEFS I SGYDREI YGFLSNKLGVSHLNLESLGFLD^ KKNDC 
RVYT*KDKKSIFI^GIRGFADYNGTIYIGGKNVVYYIDDVTC^ 

WFYDLKNWKNI PLGSNKI S SLCFDSLKNLLLVGTVDKAI YSVNVDNLKKI EHLDFFSKNDNEKNINFIKEYKDSYF 

VGTYGGGLFELNLNKNSYKKHVXANNIDVNYFMDM^ 

KVSRFENYVI LGTINNGLVFVDENI KKQL 



f50.nt 



ATGA7VATTTGTTTTGAATAATTTATTTAAAGGTTG 

ATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAG 

TGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAGAGACTAT^ 

CTTAGTGATGCTTTTTTAAAAGAAAATAATTTOT 

GCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATC^ 

TTTGTTTTTTGGATATGTAAATTATGAGAAT^ 

GACTATAAATATTCTCATGCTAGTTTAAGATO 

CTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGG 

TGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAA^ 

AATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAA 

AGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGG^ 

AAATGTTGTOTATTATATAGATGATGTTGATGGGGATTTAAA 

AATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTC 

TAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTC 
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ATTAGTTGGAACAGTTGACAAGGCTATTTATAGTC 
TTTAGCAAAAATGATAATGAAAAAAATATTAATTTTAT 

GTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGOT 

TTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTC 

TCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTC 

TTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTG 

A 



tSO.nt 



TGCCTTACTACAGATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGA^ 

TTGATGATAATAATGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAG 

TGAATCTTTTTTTCTTAGTGATGCT^ 

AAAAAAAATATTGGCTTGACAAATTATTATTTGAATAAAATAGTAACTAATC 
TAGCTAAAGCGAATTTGTTTTTTGGATATGTAAATTATGAGAATGG 
TCTATTTTTAAAAGACTATAAATATTCTCATGCTAGTTTAAGATTAGCTO 
TCTGATGCAATTTCTGCATTTAAAGAGATT 

TAAGTAATAAACTTGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTC 

TGTCTTTAATGACAATATATTTGTAACTAATATATTGGGAGGGCTTT^ 

AGAGTCTATCTTAAGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGG 

ATATTGGTGGTAAAAATGTTGTTTATTATATAGATGATGTTGATGG 

TGCTGATTTTAGCAATGTACAAGTTTTGCTTGCTGTTAAAA^ 

TGGTTTTATGATTTAAAAAATTGGAAAAATATACCGCTT^ 

TAAAAAATTTATTATTAGTTGGAACAGTTGAGAAGGCTATTTA 

AGATTTGGATTTTTTTAGCAAAAATGATAATGJ^ 

GTTGGAACATATGGTGGGGGTCTTTTTG^^ 

ATATTGATGTTAATTATTTTATGGATATGGAG 

ATTGATTTATGATTCTGAAAATGACAACTGGGATTATTT^ 

AAAGTTTCTAGATITGAAAATTATGTCATACTGGGCACTATTAATAACG^ 

AAAAACAGTTATGA 



f65.aa 

MHI FKWPFQINLILFLLVSVAKINASSKFYYAEQWYVIFNSQMKKKPENYKKNI FFLQKALKYPFGNPKYSLTKI 

ETKEQWEKYKLLFKMHVNLI^WQNI^^ 

ENYTTVKLENDGITNWEDEYHKISLKELNYYDIIKKE 

t65.aa 

KINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSL^ 
RQNLHLGDLFOTRNLYFFKTPEKIXSIISNLEKSKKLYKIAIN^ 
ISLKELNYYDI IKKELLRIDETKAFFEQGPNYY 

f 65. nt 

ATGCATATTTTCAAAAATGTCCCCTTCCAAATAAAT 

CATCGTCCAAATTTTATTACGCAGAACAATGGTATGTAATTO 

TAAAAAAAATATATTTTTTCTTCAAAAAGC^ 

GAAACCAAAGAACAGTGGGAAAAATATAAACTTCTTTT^ 

TACATTTAGGAGATTTATTCGAGACAAGAAATTT^ 

TCTAGAAAAATC^AAAAAATTATATAAACTAGCTATTAATTACTACAGCGAA 
GAAAATTACACAACTGTTAAACTAGAAAACGATGGAATAACAAACTGGGAAGATGAATATCATA 
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AAGAGCTTAATTACTATGACATTATTAAAAAAGAACTACT 
GCCAAACTATTATTAA 

t65.nt 

KINAS SRFYYAEQWYVT FNSQMKKKPENYKKNI FFLQKALKYPFGNPKYS LTKI ETKEQWEKYKLLFKMHVNLLLV 
RQNLHLGDLFDTRNLYFFKTPEKDGI I SNLEKSKKLYKIAINYYSEALKYHKKLENYTTVKLENIX^ITNV^DEYHK 
ISLKELNYYDI IKKELLRIDETKAFFEQGPNYY 



f 8 .aa 



MKNINRLILLILTTHTLLFSCAL^ 

LIKKTAAKIKISPQKLEEIKNYLNAYKNYLNNETEW 

IEI£MHLEKEILNLIEQTFHDKNLGY 

MHFLKVENSKIKTI I EKQKISDLHNELYYSKQSPPRRRKRSTADSDNNNKYDI I PKI IDPNTGIEITPKNLRSILS 

NGDI ILIKPKIDWTEFFYFWQHVGIFDEEKYEATKKI AFNGIDSFDIKS I ITSNQIKFDTASTQGS6YEKLSTYVQ 

SRILKIFSPITDIRTIQKAINFGRSRYIDNNFGYMVPLISSNLOT 

NYEISGIITPGEINAAAYNFYMSYTIAGILPSVLPK^ 

LWCSGS 



t8.aa 



caliadnksknlstseiiltqktllessliknpsnveyripissiqeilnnnndsflikktaa^ 
nyl^yknylnnetewikfidqsswgnltiki 

dknlgyiqlshinsffpqeninsitkei idgkeyiaphi ianqllkikdkkyfeqfmhflkvenskiktiiekqki 
sdlhnelyyskqspprrrkrstadsdnnnkydiipkiidpntgieitpknlrsilsngdiilikpkidwteffy™ 
qhvgifdeekyeatkkiafngidsfdiksiitsnqik^ 

NFGRSRYIDNNFGYMVPLISSNLWTDSFNLEEIH^ 

YMSYTIAGILPSVLPKRLIKPTLKEKFIGYNKEIVDAIELKKSKEKIFGRACNITNLWCSGS 



f8.nt 



ATGAAGAATATTAATAGATTAATATTATTAATATO 

ATAATAAGTCAAAAAATTTAAG(^CATCAGAAATCATATTAACACAAAAAACACTA 

aaatccttctaatgtagaatatcgaataccaatatccagt^^ 

ttaataaaaaaaagagcagcaaaaatcaaaataagccct 

ataaaaattatctaaataatgaaacagaatggataaagtttatagatcaaagtag^ 

taaaattgatactgcttttgaaaaaaaaacaaattttaatc 

atagaactacaaatgcatctggaaaaagaaattttaaacttaa 

atatacaattaagtcacatcaactcattctttcctcaagaaaatataaactc 

aaaagaatatattgcaccgcacataatagcaaatcaattattaat^aataaaagataaaaaatat^ 

atgcactttttaaaagttgaaaacagcaaaataaaaacaat^ 

aactgtattattgaaaacaatccccgcccagaagaagaaaaaggt^ 

cgatataataccaaat^taatagacccaaatacaggcattgaaataactcctaaaaatttaagatc 

aatggcgacataatactaataaaaccaaaaatagattggac^gaa 

ttgatgaagaaaaatatgaagccaotaaaaaaattgcattcaat^ 

TACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAA 
TC7UVGAATATTAAAAATATTCTCACCAATAACAGACAT 
GATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATT^ 
AGAAATTCAC7VACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAA 
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AATTACGAAATTTCGGGAATAATTACTCCTGGAGAAAT^ 
TTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTA^ 
TAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAAT^^ 
CTCTGGTGCTCAGGAAGTTAA 



t8.nt 

TGTGCCTTAATTGCAGATAATAAGTCAAAAAATTTAAGCA 

AAAGCTCTTTAATAAAAAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAA 
TAACAATGATTCTTTTTTAATAAAAAAAACAGCAGCAAAAATC 

AACTATCTAAATGCTTATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTO 

ATGGAAATTTAACAATTAAAATTGATACTGCTTTTGAAAA 

AAATTTAACAGAACTAATAGAACTACAAATGCATCTGGAAAAAGAAAT^ 

GATAAAAATTTAGGATATATACAATTAAGTCACATCAACTCATTCTTTC 

AAGAAATAATAGATGGAAAAGAATATATTGCACCGCACM 

ATATTTTGAACAATTTATGCACTTTTTAAAAGTTGAAAA 

TCAGATCTTCAGAATGAACTGTATTATTCAAAACAATCC 

ATAATAACAATAAATACGATATAATACGAAAAATAATAG 

AAGATCTATTTTATCAAATGGCGACATAATACTA 

CAACATGTGGGAATATTTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGC^ 

ATATAAAATCAATAATTACAAGCAATCAAATCAAATTCGATACA 

TTCAACATACGTACAATCAAGAATATTAAAAATATTCTCACC 

AATTTTGGAAGAAGTAGATACATTGACAATAACTTTGGATATATC 

ATTGATTCAATCTTGAAGAAATTCACAACAA 

ACTTAATGTATCAAGAAATTACX3AAATTTCGGGAATAATTACTC 

TACATGTCTTATACGATTGC AGGAATACTTCC AAGCGTGC TTCC AAAAAGGCTCATT AAAC C AACATTAAAAGAAA 

AATTCATTGGTTACAATAAAGAAATAGTAGATGCM^ 

TTGCAACATTACAAATCTCTGGTGCTCAGGAAGTTAA 



£82. aa 

MTRWSKFFLPFCFSMLLFANSEDSNEKDI VSKDENPWENEVIXSYWVGYNDVSNIKN^ 1 I YT YKYNGEVYGRI LT 

IIKDGKKYDAKNPSGDTVVGFENLAIEGLDFMWGLKYSSSSKK^ 

VWIFGRSKIWTRAKDDEI PKLDLHNLVPAPPVKK 



t82.aa 

EDSNEKDIVSKDENPWENEVLGYWVGYNDVSNIKNSI IYI YKYNGEVYGRI LTI IKDGKKYDAKNPSGDTWGFE 
NLAIEGLDFMWGLKYSS SSKKWDRGKI IDPKNGKI YNSEMRVDSKTGNLITKGIOAtf IFGRSKIWTRAKDDEI PKLD 
LHNLVPAP PVKK 



f82nt 



ATGACTAGAGTTTTTTCAAAGTTTTTTCTTTTT^^ 

ATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGA^ 

TGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAG 

ATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTC 

TAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTC 
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TCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTC 

GTTTGGATTTTTGGTAGAAGTAAAATTTGGA 

TTGTTCCAGCGCCCCCTGTGAAAAAATAA 

f82.nt 

GAAGATTCAAATGAAAAGGACATTGTTAGCAAGGATGAAAAC^ 

TTGGTTATAATGATGTAAGTAACATAAAGAATTCTATTATCTATATTTAT 

AATTTTAACTATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCA 

AATCTTGCAATAGAGGGTCTTGATTTTATGTGGGGTCTTA^ 

AAATAATAGATCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTO 

CAAGGGGAAAGTTTGGATTTTTGGTAGAAGTAAAAT 

TTGCATAATCTTGTTCCAGCGCCCCCTGTGAAAAAATAA 



£86. aa 



LIIKILEENNIINEIGSNNIESKNIALVNVSI^^ 

IASGITQPNNKLKGSGYTIDSVI INENQNINHSYNI ILKKGNYTLINRIHKILTSKKIIJNKIKSDSTIEIEAKNIS 

LLEEIENIKIETNPKILIDKKNGIILASENAKIGT^ 

NKELIQI IQAAQKINKLNGELI LEEIDGNQN 



t86.aa 



LKTDQSFNNSLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDLIIKILEE^ 

ALVNVSLQVKGNTIKGSKHKACVAS ILDSKDLTNGI LLKTNLKNKEGEI I AI ASGITQPNNKLKGSGYTIDSVI IN 

ENQNnraSYNI ILKKGNYTLI^ 

IJ^ENAKIGTFTFSIEKDNQNIFLSKNNKTT^ 

IDGNQN 

f86.nt 



ATGAACAAACTAATGTTGATGTTAATTACATTTGCAACGAGTC 

TAAAAACAGATCAATCATTTAACAATAGCCTATC^ 

AAATAGAAATTTTTTAACAGGTATTGGAATAGT^ 

CTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTC 

CACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATAC 

ACTGGACTCAAAAGATTTAACAAATGGAATACTT^ 

ATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATC 

AGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATAC^ 

AATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAATAG 

CTATTAGAAGAGATTGAAAATATTAAAATAGAAACCAACCCCAAGATATT 

TAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCC 

AAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATT^ 

AATAAAGAATTAATTCAAATAATTCAAGCTGC 

TTGATGGAAACCAAAATTAA 



t86.nt 



WO 98/59071 



PCT/US98/12718 



84 

TABLE 1 . Nucleotide and Amino Acid Sequences 

CTAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATT 

CAAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGA 

CCTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTC 

GCACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAA 

TACTGGACTCAAAAGATTTAACAAATGGAATACTTTTA 

AATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATCTGGATATACTATAGATAGTC 

GAGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATACATTA 

AAATATTAACCTCTAAAAAAATCAACAACAAAATTAAATC 

CCTATTAGAAGAGATTGAAAATATTAAAATAGAAACCAA 

TTAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCA 

AAAAT7UVCAAAACAACAATTCAAGTAAACTCAATGAAATTAAA 

CAATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAATTAA^ 

ATTGATGGAAACCAAAATTAA 



f90.aa 



MCPITFTIPFFLAIFFAFSSSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIWLSKKYLSISLSFSKMIFI 
KSLFEVIKLPIWLFIIFASGYFLNAFSIFLCISSFLSFMFI 



t90.aa 



SSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAJVTL^ 
GYFLNAFSIFLCISSFLSFMFI 

f90.nt 



ATGTGTCCTATTACTTTTACCATTCCATTTTTTCTAG 

CTTCTGTGTCTTTGCTATCAAGAAATACG 

TACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTC 

AAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTAT^ 

TTTCGATTTTTTTGTGTATTTCTTC 



t90.nt 



AGCTCCTTTGTTACGGACTCTTCTGTGTCTTTGC 
CTTTGCCTATTATTTCTGGTACGCTTCCTGC 
TTCTAAAATGATTTTCATCAAATCTTTATTTG^ 
GGATACTTTTTAAATGCT1TTTCGATTTTTTTC 



f469.aa 

MANVALSSGFISQKIFGI 1 1 IMVFLPTI IATPI INFLFKINKSGLKKELPIDQNTHICVSFEYDNLAKILIWDFKN 
ELRKEGFFTQQIKNDSSQYINARKNNISFSIKREXSSKI^ 

KLDYS INYDKILSNINLNKRIKKENI I LELKS SNKADVIRELLSVINI EIDKERIFQDLMEREKLITTALKEGFAI 

PHLKTNLISKIHIAIGISHEGIDFNALDK^ 

NIIVSZ 



t469.aa 



WO 98/59071 



PCT/US98/12718 



85 

TABLE 1. Nucleotide and Amino Acid Sequences 

VFLPTIIATPIINFLFKINKSGLKKELPIIXJNra 
RKMNISFSIKREGSKITFECPNNHLIIIQDLFRETILNLEK^ 

KENIILELKSSNKADVIRELLSVINIEIDKERIFQDLMEREKLITTALKBGFAIPHLKTNLISKIHIAI 
DFNALDKNLSHVF ILILC PAKDYVS YPRI LASWGKVDLYKKEI LNAKTDKEI YNI IVS Z 

f469.nt 

ATGGCAAATGTAGCATTATCTTCAGGAT 

CAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATA 

AGATCAAAATACACACATATGCGTATCAT^ 

GAGTTAAGAAAAGAAGGATTTTTTACAC^CAAATTAAA 

ATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATG^ 

AGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAA^ 

AAACTAGATTACTCAATAAATTACGATAAAATCCTT^ 

TTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCT 

TAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACT^ 

CCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAA 

CTCTTGAGAAGAACTTAAGTCATGTTTTTATATTAATACTC 

TTTAGCATCTGTTGTGGGCAAAGTTGATCTGT^^ 

AATATAATAGTGAGCTAA 

t469.nt 

TTTTTGCCAACAATCATTGCAACACCCATAATAAACTTTTTATOT 
TCCCAATAGATCAAAATACAGACATATGCGTATC 
TAAAAATGAGTTAAGAAAAGAAGGATTTTTTACACAACAAATT 
AAAAACAATATATCCTTCTCAATAAAACGAGAAGGT^ 

TAATACAAGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGA^ 

AGCAAAAAAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAA 

GAAAACATTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGA 

AAATTGATAAAGAAAGAATATTCCAAGATTTAATGGAAAGA 

TGCCATTCCCCATTTAAAAACAAATTTAATATCAAA 

TTTAATGCTCTTGACAAGAACTTAAGTCATGTTTTT 

CTAGAATTTTAGCATCTGTTGTGGGCAAAGTTGATCTG 

AATTTATAATATAATAGTGAGCTAA 

f477.aa 

MEKPQGVSIVGAISGAMHVHLMAEHYGVPWL^ 

KENIEISKKFLERMAKIEMFLEIELGITGGEEDGVDNSDRAL^ 

GVYKPGNVKLTPKVLKDGQDWISKTG^ 

YYKKNESRLQGQLGDGKDIDI PNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRNZ 
t477.aa 

MHVHLMAEHYWPVVLHTDHCAKNLLPWVEGL^ 
IEMFLEIELGITGGEEIX3VDNSDRALHELFSTPEDI1^ 
DGQDYVISKTGVNMAKPVSYWHGGSGSTIDEINEALSYGVV^ 
KDIDI PNKKFYDPRVWLREAEVSMKDRWI ACKNLNNINRNZ 

f477.nt 

ATGGAAAAACCACAAGGAGTTTCAATAGTTGGAGCTATTTC 
ATGGTGTTCCTGTTGTTCTTCATACTGATC 

TGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACATATGTT 
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AAAGAAAATATTGAAATTTCTAAAAA 

GAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGA 
TATTTATTATGGATATTCAGAACTTTT^ 

GGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAAGATGGTC^ 

CAGGAGTAAATATGGCTAAGCCAGTTTCTTATGT^ 

GGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACA 

TATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGAT^ 

TTTATGATCCAAGGGTTTGGTTAAGAGAAGCTGAAGTTTC 

TAATAATATTAATAGAAATTAA 

t477.nt 

ATGCATGTTCATTTAATGGCAGAGCATTATGGTGTTCCTGTTGTTC 

TTCCTTGGGTTGAAGGCCTTTTAGAATATGGAGAGAAATACTATAGTC 

TATGTTAGATTTATCAGAAGAACCTATTAAAGAAAATATTGAAATTTCTAA 

ATTGAAATGTTTTTGGAAATAGAGCTTGGAATTACGGGTGGGGA 

TGCATGAACTATTTTCTACTCCTGAGGATATTTATTATGGATA 

GATTGCAGCAGCTTTTGGAAATGTTCATGGGGTATATAAACCO 

GATGGTCAAGATTATGTCATATCAAAAACAGGAGTAAATATGGCTA^ 

CTGGATCTACAATTGATGAGATTAATGAGGCGCTTTC^ 

GTGGGCTGCCTGGGAGGGTGTTTTAAATTATTACAAAAAAAATGA 

AAGGATATTGATATTCCAAATAAGAAATTTTATGATCCAAGG^ 

ACCGTGTGAAGATTGCATGCAAAAATCTTAATAATATTAATAGAAATTAA 

f488.aa 

MPSSFPFLLVNGSSGIAVGMATNMAPHNLREICDAIVYMLDNENAS IFDLLKIVKGPDFPTFGE IVYNDNLI KAYK 
TGKGSWTRARYHI EERAEDRNAI IVTEI PYTVNKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGF 
DPHVIMNLLYEYTEFKKHFS INNLALWGI PKQLNLEELLFEF I EHRKNI I ERR I EFDLRKAKEKAHVLEGLNI AL 
NNIDEVIKIIKSSKLAKDARERLVSNFGLSEIQANSVI^^ 

I INI IREETINLGLKFGDERRTKI I YDEEVLKTSMSDI^QKENIVVMLTKKGFLKRLSQNEYKLQGTGGKGLSSFD 
LNDGDEIVIALCVNTHDYLFMI SNEGKLYLINAYEIKDSSRASKGQNI selinlgdqeeiltiknskdltddayll 
LTTASGKIARFESTDFKAVKSRGVIVTKIiNDKDFVTSAEIWKDEKVICLSKKGS 

MKLKEGDLFVKVLSVKENP YLL IVSENGY GKRLNMSKI SELKRGATG YTSYKKSDKKAGSVVDAI AVSEDDE I LLV 
SKRSKALRTVAGKVSEQGKDARGIQVLFLDNDSLVSVSKFIKZ 

t488.aa 

MATNMAPHNLREICDAI VYMLDNENAS IFDLLKIVKGPDFPTFGEIVYNDNLIKAYKTGKGSWIRARYHI EERAE 
DRNAI IVTEI PYTVNKSALLMKVALLAKEEKLEGLI^ 

SINNLALVNGI PKQLNLEELLFEFIEHRKNI IERRIEFDLRKAKEKAHVLEGLNIALNNIDEVIKI IKSSKLAKDA 
RERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSL INI IREETINLGLKFGDE 

RRTKI I YDEEVLKT SMSDLMQKENI VVMLTKKGFLKRLSQNEYKLQGTGGKGLS S FDLNDGDE IVT ALCVNTHD YL 
FMI SNEGKLYLINAYEIKDSSRASKGQNI SELINLGDQEEILTIKNSKDLTDDAYLLLTTASGKI ARFESTDFKAV 
KSRGVIVI KLNDKDFVTSAEI WKDEKVICLSKKGSAFI FNSRDVRLTNRGTQGVCGMKLKEGD 
YLLT7SENGYGKRLNMSKI SELKRGATGYTSYKKSDKKAGSWDAI AVSEDDEI LLVSKRSKALRTVAGKVSEQGK 
DARGIQVLFLDNDSLVSVSKFIKZ 

f488.nt 

ATGCCX3TCATCATTTCCATTTCTTTTGGTAAATGGCTC 

CTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACAT^ 

TAAAATAGTTAAAGGGCCTGATTTCCCAACTTTTGGAGAGATTC 

ACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGAT^ 

TTACAGAAATACCTTATACGGTAAATAAATCTGCAC 

AGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTC 
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GATCCTCATGTTATTATGAATTTGCTTTAT^ 
TTGTTAATGGTATTCCCAAACAGTTAAATTTAGAA 

CGAAAGACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATC 

AATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAA 

ATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTG 

TAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATG 

ATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTO 

ATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATA 

AGGTTTCCTTAAAAGACTTTCACAAAATGAGTATA 

CTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTC 

GAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATT^ 

TATTAATTTAGGAGATGAAGAAGAAATATTAAOT 

CTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCT^ 

TTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGA 

TAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATC 

ATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATC 

AAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGAG^ 
TAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGC^ 
AGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAA 
TATTATTTCTTGATAATGACAGCTTGGTTTCTG^^ 

t488.nt 

ATGGCTACTAATATGGCACCTCATAATTTAAGAGAAATTTGTGATGCCA 

CTTCTATATTTGATTTGCTTAAAATAGTTAAAGGGCCTC 

TTTAATTAAAGCATACAAAACTGGCAAGGGAAGTGTTGTTATTAGGGCAA^ 

GATAGAAATGCTATAATTGTTACAGAAATACCTTATACGGTAAATAAATCTGCACTTC 

TAGCAAAAGAAGAAAAGCTAGAAGGACTTTTAGATATAAGAGATGA 

TGAAGTTAAAAGAGGATTTGATCCTCATGTTATTATGAATTT^ 

AGTATAAATAATTTAGCCCTTGTTAATGGTATTCCCAAACAGTTAAATTT^ 

AGCATAGAAAAAATATTATCGAAAGACGTATTGAATTTGACTTGAGAAA 

GGGATTAAATATTGCTTTAAATAATATAGATGAGGTTATTAAGATTATTAAATC 

AGGGAGAGGCTTGTTTCGAATTTTGGTCTTTCAG^ 

TTACAGCCCTTGAGATTTTTAAGCTTGAAGAGGAGCTT 

TCTCTTGAATCCAGTAAGGATTATTAATAT^ 

CGTCGAACTAAAATAATTTATGATGAGGAGGTTOT 

TTGTTATGCTTACAAAGAAAGGTTTCCTTA^ 

AGGACTAAGTTCGTTTGATCTAAATGATGGAGATGAGATTC 

TTTATGATTTCAAATGAAGGAAAGCTTTATTTAATCAATGC 

GTCAGAATATTAGTGAGCTTATTAATTTAGGAGATC 

TGATGATGCTTATTTATTGCTTACAACTGCAAGT^ 

AAGTCACGAGGTGTTATTGTTATTAAACTGAATC^ 

aaaaagtaatttgtctttctaaaaagggtagt^ 

tacccaaggtgtttgtggaatgaaattaaaagaaggtgatttgtttgot 

tatcttttgattgtttctcaaaatggot 

ccactggttatactagttataaaaaatctgataaaaaagcgggtagtgttgt^ 

tgatgaaatcttgcttgtaagtaaacgttcaaaagctttaaga^ 

gatgctagaggaattcaagtattatttcttga™ 

f494.aa 

MFALIRKIFMI YFLC ITLAGFAMI F IDSKFTEQPNVKENQSKINQHTI EPNLIMFTSS IGGFLGVYVGIWI FNYDK 
SNFYLNWGNLIILIYNIALIITVYSKSHS 



t494.aa 
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MIFIDSKFTEQP3WKENQSKINQHTIEPNLIMFTSSIGGFLGVW 
VYSKSHS 

f494.nt 

ATGTTTGCATTAATTAGAAAAATATTTATGATCTATTTTTTAT 

TTGACAG^AAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAA 

TTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATC 

AGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATAC^ 

AATCACATAGTTAG 

t494.nt 

ATGATTTTTATTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAA^ 
TTGAACCCAATTTAATCATGTTTACATCTTC 

CTATGACAAAAGCAATTTTTACCTAAATTGGGGAAATTTAATAATATT^ 
GTATACTCAAAATCACATAGTTAG 

f516.aa 

MKKTPNTCIFLTLLIISNLNAI^ 

TFSNNTFDFLIVAMILTRESI^IPKKMQYFKSYIGGGINWHIANLIKKTKYFS 
KLPYVIEPYMFIEISSKKAI PLMGLDFKIDFLFLDTFNISFNFTIRYNFKDKNEMET 

t516.aa 

NEEX3NTNEKNDQPKQISNFFSPERGFIYSTGIGI 

PKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYEKLPYV^ 
GLDFKIDFLFLDTFNI SFNFTIRYNFKDKNEMET 

f516.nt 

ATGAAAAAAACTCCAAACACTTGTATTTTCTTAACATTGCTTATC 

AAGGCAATACTAATGAAAAAAATGATGAACCCAAAC 

TTCAACAGGAATTGGGATTGGAGTTGGAT^ 

ACATTCTCTAATAATACTTTTGATTTTOT 

AAATGCAATACTTTAAATCTTATATTGGAGX^ 

TTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACC 

AAATTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTO 

ACTTTAAAATTGATTTTTTATTTTTAGATAC 

CAAAAACGAGATGGAAACATGA 

t516.nt 

AATGAAGAAGGCAATACTAATGAAAAAAATGATCAACCCAAACAAATCTC 

TCATATATTCAACAGGAATTGGGATTGGAGTTGGAT^ 

TTATTATACATTCTCTAATAATACTTTTGATTTTTTAATC 

CCCAAAAAAATGCAATACTTTAAATCTTATATTGGAGG^GG 

CAAAATATTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTO 

TTACGAAAAATTGCCTTATGTAATAGAGCCTTATATGTTTATTG^ 

GGGTTAGACTTTAAAATTGATTTTTTATTTTTA 

TTAAGGACAAAAACGAGATGGAAACATGA 



f517.aa 
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MIPWASCX3ILIALSIAFVGIGPDGPNFAEHP 

GNVKAGFLGAI FAGFIiAGYVARFLARRSVPEWLRPVMPI FVI PLI STI IVGFFMLYFGVY IGKFMGVLESGLKSLQ 
SNSETFGVLGKIFLGLVIjGSMI TVDMGGPFNKVAFIiFGVGLI PQVPEIMGMVAAAI FVPPMAMGLATFLAPKIjFEN 
EEKESGKI AFLI S FIGI SEGAI PFAASDPGRVI PS IWGGAVS S 1 I AAFI/3VANHAPHGGP IVLFVIDNKFGFI IA 
IAVGVAVATALVIFLKSLKLKESE 

t517.aa 

dkpgltpglvggvmsgnvkagflgai fagflagyvarflarrsvpewlrpvmp i fvi pli sti i vgffmlyfgvyi 
gkfmgvlesglksi^snsetfgvix;kiflglvix3Smitvdmggpfnkva^ 
amgiatflapklfeneekesgkiaflisfigisegaipfaasdpgrv^ 
ivlpvidnkfgfi i ai avgvavatalvi flkslklkes e 



f517.nt 

ATGATTCCTGTTGTTGCAAGTGGAGGAATTTTAATTGCTC 

OTAATTTTGCTGAGCATCCATTTTATAAGCAGATTGCAGATATT^ 

GCTTGCTGGTTTTATTGCAATGGCAATTGC 

GGGAATGTAAAAGCAGGTTTCTTGGGCGCAATATTTGCGGG^^ 

GAAGATCTGTTCCTGAGTGGTTAAGACCTGTAATGCCTATATTTGTAATTC 

CTTTTTTATGCTGTATTTTGGTGT^ 

AGTAATTCGGAAACTTTTGGCGTGTTGGGTAAAATTO 

TGGGCGGACCTTTTAATAAAGTGGCATTTCTTTTTTC 

GGTAGCAGCAGCAATTCCTGTTCCrcOTATGGCTATGGQ 

GAAGAAAAAGAATCTGGTAAAATAGCCTTTTTAATTTC 

CTAGTGATCCCGGACGGGTAATCCCTTCGATAGTGGTAGGGGGAGCTGTATCAAGCAT^ 

CGTOXTTAATCATGCTCCACACGGAGGACCAATAGTACTTCCTGTTATTGAT 

ATTGCTGTTGGAGTTGCGGTTGCAACAGCTTTG 

t517.nt 

GATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGM 

TATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGG 

AATGCCTATATTTGTAATTCCGCTAATAAGCAC 

GGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACA 

AAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTC 

TTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATA^ 

GCTATGGGGCTTGCAACCTTTTTAGCACCT 

TAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCT^ 

AGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGGCG 

ATAGTACTTCCTGTTATTGATAATAAATTTGGGTTT^ 

TGGTAATTTTTTTGAAATCTTTAAAATTAAAG 

f519.aa 

MIKIFKKIYILTLVLGMAHLSFASDNYKVRCSKEEDSTTCIAKLKEIKEKKNYDLFSM 
NIDFGYGGFIGLKSNNFENYLNGGIimFKKQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETO 
IMEYNFAKNLS IGFKI YPAVGPTILLTKPS I LFEGI KFNFFGFGFIKFAFN 

t519.aa 

DNYMVRCSKEEDSTTCIAKLKEIKEKKNYDLFSMGIGIGDPIANIMITIPYINIDFGYGGFIGLKSNN^ 
IDVIFKKQIGQYMKIGGGIGIGADWSKTSLI PPNEEEETDYERIGAVIRI PFIMEYNFAKNLSIGFKIYPAVGPTI 
LLTKPSILFEGIKFNFFGFGFIKFAFN 
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TABLE 1. Nucleotide and Amino Acid Sequences 
f 519. nt 

ATGATAAAAATTTTTAAAAAAATATAC^ 

ATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACC^^ 

AAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTA 

AATATTGATTTTGGATATGGAGGTTTTATTGGCCTT 

ACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTG^ 

AAC^TCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGCTO 

ATAATGGAATATAATTTTGCAAAAAATTTATCC^^ 

TAACAAAACCAAGCATTTTATTTGAAGGAATTAAA 

TTAA 

t519.nt 

GACAATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAA^ 
AGAAAAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATC 
TATAAATATTGATTTTGGATATGGAGGTT^ 
ATAGACGTTATTTTTAAAAAGCAAATTGGACAATA 

CAAAAACATCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGC 

TTTTATAATGGAATATAATTTTGCAAAAAATTTATCCATAGGA 

TTACTAACAAAACCAAGCATTTTATTTGAAGGAATT 

TTAATTAA 

f520.aa 

MRMLI^TIILILTTGLLAAQSKSKSMTETO 

VGLKPNNFLPYVVMGVDLLFKDEIHKNTMISGGIGIGADWSKGSPEKSNEK^ 
PLVIEYSFLKNIVIGFKAVATIGTTMLIX3SPMSFEGARFNFLGTGFIKIYI 

t520.aa 

QSKSKSMTEDDFDFDKLLAKEESVRRLFGIGFGVGYPLANITISVPYVDIDtX^ 

FKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEBENEAQQVASLQNRIGWIRLPLVIE^^ 

ATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

f520.nt 

ATGAGAATGCTATTAGCAACAATAATACTTATATTAACAACGGGTTT 

TGACTGAAGATGACTTTGATTTTGATAAACT^ 

TGGAGTTGGATATCCACTTGCAAACATTACAATATC 

GTAGGGCTTAAACCCTUVCAATTTCTTGCCCTATGTTGTGAT^ 

AAAACACTATGATTTCTGGAGGCATTGGAATAGGTGC 

ACTTGAAGAAGAGGAAGAAAATGAAGCACAAC^GTAGC^ 

CCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTC 

TGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAA 

ATAG 

t520.nt 

CAATCCAAAAGCAAAAGTATGACTGAAGATGACTOT 

GTTTATTTGGCATAGGTTTTGGAGTTGGATATCC^ 

CCTTGGGTACGGAGGATTCGTAGGGCTTAAACCCAACAATTTCTO 

TTTAAAGATGAAATACACAAAAACACTATGATTTCTC 

CTGAAAAATCAAATGAAAAACTTGAAGAAGAGGAAGAAAATG^ 

AGGGGTTGTGATAAGATTGCCTTTGGTAATAGAGTAC 



WO 98/59071 



PCT/US98/12718 



91 

TABLE 1 . Nucleotide and Amino Acid Sequences 

GCTACTATTGGAACAACTATGCTACTT^ 
GCTTTATAAAAATATATATATAG 

f523.aa 

MNIKINFFFTLPIGIFI^LFFPI^IYSSI^HAFIRLSYL^ 

NLSGVAVS I IAATI YLPQRI PILEKTIQNTCFFEKEALLETFFPKNIFKIFTSSNPNLLSIYMISII IGTSFYYAK 
QKGRIARELMLSASNLFYHANGFIWII^IGIIFITANYAANLKNFKDYPNYTNSITF 

RLTKSFKMI YKGI FVS FQNI I FSGLAKDS YS PYVT L IEDI KNERINI KKS 1 1 INI PL INFVSKFGTI FVSVI SFFI 

ILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIELNVSNITPMLPILISLA^ 

NIAIIHIINFKELKDQEKIN 

t523.aa 

I ENIIENKNFKKLFGKTIYYGILTNLSGVAVSI IAATI YLPQRI^ 

SSNPNLLS I YMI S 1 1 IGTSFYYAKQKGRIARELMLSASNLFYHANGF IVNILNIGI I FITANYAANLKNFKDYPNY 
TNSITFFLAWTIIILFVILPTISYRLTKSFKMIYKGiro 

INIPLINFVSKFGTIFVSVISFFIIIJCSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMI/:STYra 
LNVSNITPMLPILISLALLIDFAFNIAIIHIINFKELKDQEKIN 

f523.nt 

ATGAATATAAAAATCAATTTTTTTTTCACTTTGCC 

ATAGCTCCTTATCACATGCTTTTATAAGATTATCATACTTATC 

AGGAATTGAAAATATTATTGAAAATAAAAACTTTAA 

AACCTATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATC 

CAATACAAAATACATGTTTTTTTGAAAAAGAAGCTTO^ 

TACATCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCA 

CAAAAAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCT^ 

ACATATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAA 

TTATACAAACAGCATAACATTCTTTTTGGCATGGA 

AGATTAACAAAAAGTTTTAAAATGATATATAAAGGCATOT 

CAAAAGATTCTTATTCCCCTTATGTGATATTAATAGAAGATAT^ 

AATTATAAACATACCTTTAATAAATTT^ 

ATTTTAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATA^ 
TCTTTGCATTTCCTC^TATACCAAATAGTTTAAT^ 
AGAGCTAAATGTTTCAAACATAACACCAATGCTGCCGATA 
AACATTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAG^ 

f523.nt 

ATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTT^ 

TATCTGGAGTTGCTGTATCAATAATAGCTGCAACAAT 

ACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAG 

TCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAAT 

AAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATC 

ATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCA 

ACAAACAGCATAACATTCTTTTTGGCATGGACAATTAT^ 

TAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTAT^ 

AGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATO 

ATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCA 

TAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATG^ 

TGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTAC^ 

CTAAATGTTTCAAACATAACACCAATGCTGCCGATA 

TTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAA^ 
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TABLE 1. Nucleotide and Amino Acid Sequences 
f526.aa 

MKKEFIMLLLLLQT IMNLNS INTNTSTSIVKELQKNLYI FNSKEYQKDKDTLNEFINS ININDKEI LQSLEKIKNE 

LFIISWFNNKKGILIALNIjGAEINFKYKISPISISIINNEFEITKILIDYGISLNQI 

IFEFLKESGADLSFTLKNRKTPMQAAIETENIKLIKSLEKKKIYIDDNFIOCKLKKLKNKEIVRILVK 

t526.aa 

NSINTNTSTSIVKELQIOILYIFNSKEYQKDKiyrLNEFINSININDKEI^^ 

NLGAEINFKYKISPISISIINNEFEITKILIDYGISLNQ^ 

RKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

f526.nt 

ATGAAAAAAGAATTCATTATGCTTTTACTGTTATTGCAAAC 

GTACTTCAATAGTAAAAGAATTGCAAAAAAATTTA 

TTTAAATGAATTTATAAATTCAATAAATATAAATG^ 

CTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGG^ 

TTAAATATAAAATATCTCCAATTTGAATTTCA 

CGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCC^ 

ATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTC 

CAATAGAAAGAGAAAATATAAAACTAATTAAATCTCTGGAAAA 

AAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAATAG 

t526.nt 



AATATO^AAAAGATAAAGACACTTTAAATGAATTTAT^ 

TTTAGAAAAAATCAAAAATGAGCTTTTTATAATATCT 

AATCTTGGAGCAGAAATAAACTTTAAATATAAAATATCTCCAATT^ 

TCACAAAAATATTGATAGATTACGGAATAAGCCTTAATCAAATAGATGATACAGGOT 

AATATATACTAATAACGAAAAAATATTTGAATTTTTAAAAGAAAGCG^ 

AGAAAAACACCAATGCAAGCCGCAATAGAAACAGAAAATATAAAACTAATTAAATC 

AC^TTGACX^C^TTTCAAAAAAAAACTTAAAAAGCTT 

f544.aa 

MTKNRI IWLLVLMVS STFTATI I SNYQNLMLSLVVLANFI PLLMDTSGNAGSQASALI IRELALGTVKVKDFFKVF 

LKEIOTSILVGAILASVNFLRIWFVAPHHSDKLKIAFWSSCLMVSLWAK 

ITTIADAI TL I AYFNIAKWVLVS YAV 

t544.aa 

STFTATI I SNYQNLMLSLWLANFI PLLMDTSGNAGSQASALI IRELAIiGTVKVKDFFKVFIjKEICVS I LVGAI LA 

SVNFIJIIVFFVAPHHSDKI^IAFWSSCLMVSLWAK^ 

IAKWVLVSYAV 

f544.nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTG 

ATCAAAATTTAATGTTGTCTTTAGTGGTTTT 

CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTC^ 

TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTC 

TAGCTCCACACCATTCTOATAAGCTGAAAATAGCTTTTG 

AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTO 
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ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTO 
TTTAA 

t544.nt 

TCTACTTTTACAGCTACAATTATTTCAAATTA^ 

CCCTTTTAATGGATACTTCAGGCAATGCCGGCTCT^ 

TGTCAAGGTAAAAGATTTTTTTAAAGTGTTTTTAAAGGAAAT 

AGTGTOAATTTTTTAAGAATTGTCTTTTTTC 

CATCTTGCTTGATGGTAAGTTTGACAGTAGCAAAGATATO 

GTTGGATCCAGCACTTATGGCAGGCCCTTTAATC^ 

ATAGCAAAATGGGTTTTAGTTAGCTATGCTGTTTAA 

f545.aa 

MTKNRI IWLLVLMVS STFTATI I SNYQNLMLSLWLANFI PLLMDTSGNAGSQASALI IRELALGTVKVKDFFKVF 

IJCEICVSILVGAILASVNFLRIWFVAPHHSDKLKIAFWSSCLIW 

ITTIADAITLIAYFNIAKWVLVSYAV 

t545.aa 

GSQASALIIRELALGTVKVIODFFKVFLKEICVSILV^ 

AKIIX3GLLPIVAKLLKLDPALMAGPLITTIADAITLIAYFNIAKWVLVSYAV 
f545.nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTO 

ATCAAAATTTAATGTTGTCTTTAGTGG 

CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGC 

TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTC 

TAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTC 

AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAAC 

ATCACTACAATTGCAGATGCTATTACTTTAATA 

TTTAA 

t545.nt 

GGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGC^ 

TTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGC^ 

TGTAGCTCCACACCATTCTGATAAGCTGAAAATA 

GCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTA 

TAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTAT^ 

TGTTTAA 

f577.aa 

MRIKNLILIAILLISPSCSTNKNIVVLTD 
TNIANHFKSVKINYNPDYPILKHIFKQFNYKIIPLGFDIP 
YVSENLFYVISQINNVRFSFEKNKLNYNEN^ 
GLSDITFYNSLSEQEKSQIKTSYLINDNNEIVI^ 

FGFANGFTPYKELNLKIKHSIDGI SPFI IDETQINSHSYVLSKKTIEKENLLINEWFFSKANNLKKNKN 
t577.aa 

NKNIVVLTDNKTIPFYINQFNIENKANFIIKFRNNIDLQTIEKENAQIIISKNIGN^ 
LKHIFKQFNYKIIPLGFDIPILIYKNTHHIKKYINTKYLKEEYENFIKIX3KF 
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EKNKLNYNENQILKMLEYFSSFLNTO 
FSYLINDNNEIVISNPNFIGILETSVLTO^ 

IIX5I SPFIIDETQINSH£TSfVLSKKTIEKENLLINEWFFSKANNLKKNKN 
f577.nt 

ATGAGAATAAAAAATTTAATACTAATAGCAATTTTAOT 

TACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAG 

GTTTAGAAATAATATTGATCTGCAAACAATAGAA 

ACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATC 

AGCAATTTAACTACAAAATTATTCCATTGGGCTTTGAC^ 

AAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAA 

TATGTTTCTGAAAATTTATTTTATGTC 

ATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATC 

AAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAA 

GGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAAT 

ACX3ATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTG^ 

TATCAACTGGATATTGTATAAAAAAACTCAAAAAACCCTAATTGGAOT 

TTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTA 

CTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATT 

ACTAATAAATGAATGGTTTTTCTCTAAAGCTA^ 

t577.nt 

AATAAGAACATCX5TTGTACTAACTGACAATAAAACAATAC^ 

CAAATTTTATAATTAAGTTTAGAAATAATATT^ 

TAAAAAGATTGGTAACACAAATATTGCTAACCATTTTAAATCTC 

TTAAAGCATATTTTCAAGCAATTTAACTACAAAATTATTCCATTG 

ATACACATGATATTAAAAAATACATAAACACTAAATATC^ 

ATTTTTTATATCGCCTTATGTTTCTG 

GAAAAAAATAAATTAAATTATAATGAGAATCAAATTTTAAAAATGOT 

AACAAATGGACTTGCAAAAAGATTTCTTTAATAAATACGGCTACCTAAAGOT 

ATCTCTTTTAATAGCAGGATTGAGCGATATAACCTTCTACAATAGCTO 

TTTTCXTTATTTAATAAACGATAAGAAT^ 

TTTTAACTAAAAAATTTATCAACTGGATAT^ 

CCAATCAAATATATGTTTTGGATTTGCX!^ 

ATTGATGGAATATCTCCTTTTATTATTGACGAAA 

TTGAAAAAGAAAACTTACTAATAAATGAATGGTTTTTC^ 

f584.aa 

MIKTILLLVLYPWVFSQI SANQYFEGI YAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFI INLDSNNQVFVS 
IXSEFLTVYVPSI^TSFNQQLLKGSSGGGLMKVL^ 

FIIAFAPDGIIRRITAFPTSGGREIVIDLTAVKFNVGIIiDSKFKYDPPKSSNKVDNFLTO 



t584.aa 

QISANQYFEGIYAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFIINLDSNNQW 
QQUuKGSSGGGLMKVLNSEYSVSYTO 

PTSGGREIVIDLTAVKFWGIIJ)SKFKYDPPKSSNKVI»JFLYDIKKN 
f584.nt 

ATGATAAAAACAATACTTTTATTAGTTTTGT 
AAGGAATTTATGCTAAATATCAAAATATAGAGGACATG 
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AACAGGTGTTTTGCTTTATAAGTTTCC^ 
GATGGTGAATTTTTGACAGTTTATGTTCCATC 
GGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATC 
TTCATCTGAGCCTGGAAAATATATTAAATTAA 

TTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGC 
TTATTGATTTGACTGCTGTGAAGTTT 

AAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 
t584.nt 

CAAATATCTGCAAATCAATATTTTGAAGGAATTTATGCTAAATATC 

ATTTTACTTTAAAGGGGTTAAAGCAAACAGGTGTTTTG^ 

TTCAAATAATCAAGTTTTTGTAAGTGATGGTGAAT^ 

CAGCAATTATTAAAGGGTAGTAGTGGGGGAGGTCTTATGAAAGTTTTAA^ 

ATTCTCCAAATTTAGAAGATCTCGATTCATCTGAGC^ 

CAAGGGGGCTGCTACTATTAATTCTTTTATTATTGOT^ 

CCTACTAGTGGTGGGCGCGAAATAGTTATTGATTTGACTGCTGTC 

TTAAATATGATCCrrcCAAAATCTTCAAATAAGGTAGATAATT^ 

f596.aa 

MKERCLYLLVFVALCVNNLFSDDYLIYDFDLSLNEFLE^ 
YSKKYLFKKNEHGVFFVKVNIPHGTSSIKYRLIVIXSVWTra 
SYDNNEIEIFYIGRPGQIVTIAGSFNNFNPFLNRLIEKEDNKGIYTIKLKN^ 
INLYFVEG I DNKI DFEVS YFDHK 

t596.aa 

DDYLI YDFDLSLNEFLEVSTRKDNLEPMVDSNRI LLFYPPKKEIRKI FAAFDFDQYSKKYLFKKNEHGVFFVKVNI 

PHGTSSIKYRLIVIXSVWTNDEYNKNVVTO 

AGSFNNFNPFLNRLIEKEDNKGIYTIKLKNIjP 

HK 

f596.nt 

ATGAAAGAAAGGTGTTTGTATTTATTGGTTTTTGTAGCT^ 

TTTATGACTTTGATTTAAGOTT^ 

TTCCAATCGTATATTATTGTTTTATCCTC 

TATTCTAAGAAATATTTATTCAAAAAAAATGAGCATGGAG1 

GCAGTATAAAATATAGGCTTATTGTAGACGGTGTTT 

GGATTTAATCCCATTTTCTAAAATTGAGATCX3CTAA 

TCATATGATAACAATGAAATTGAAATTTTTTAC^ 

AC^^TTTTAATCCTTTTTTAAATAGGCTT^ 

TTTACCCAAGGATAGAATTTATTATTATTTTATTGATT^ 

ATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCG^ 

t596.nt 

GATGATTATTTAATTTATGACTTTGATTTAAGTTT 
AGCCTATGGTTGATTCCAATCGTATATTATTGTTTTATC 
TGACTTTGATCAGTATTCTAAGAAATATTTATTCAAAAAAA^ 
CCTCATGGCACAAGCAGTATAAAATATAGGC 

TAGTTTATAATGAGGATTTAATCCCATTTTCTAAAATTGAGATC^ 
AAATCCAATAGAATGATATGATAACAATGAAATTC 
GCTGGTAGTTTTAACAATTTTAATCCTTT^ 
TTAAGCTTAAAAATTTACCCAAGGATAGAATTTATTATTATTTO 
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TAATGTTAATAGAATTAATTTAT 
CATAAGTAA 

f598.aa 

MRQRVMIAMALSCHPSLLIADEPTTALDVTIQEQIL^ 

EEGTVEEIFNNPKHPYTIGLLKSILTLEHDPNKKLYSTKENPMKITKTSTEEF 
t598.aa 

EPTTALDVTIQEQILLLIKNLSKKFNTST^ 
KSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

f598.nt 

ATGAGACAAAGAGTTATGATTGCCATGGCTCTTAGCTGTCA 

CCCTTGATGTTACAATCCAAGAGKIAAATATO 

ATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTO 

GAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACA 

CGCTAGAACACGATCCAAATAAAAAGCTTTATTCAAC^ 

GGAGTTTTAA 

t598.nt 

GAACCAACAACAGCCCTTGATGTTACAATCCA^ 

ATACTTCTACCATATTTATAACTCATGATCTTGCGGTTGTTG^ 

AGGAAAAATTGTAGAAGAAGGAACAGTAGAGGAAATATTTAACAA 

AAATCAATTCTTACX5CTAGAACACGATCCA 

AAACCAGCACCGAGGAGTTTTAA 

f6O0.aa 

MAIMERSIIGLFIALAFVSWLTVARVV^ 

FIMAEAFLSFLGLGI SAPMTSWGELVQNGI ATFVEYPWKVFI PAIVMTI FLLFMNFLGDGLRDAFDPKDS I 



t600.aa 

RWRGQVQSLSSSEFIQAAKTLGATNQRI ILKHLI PNSIGMIVIFTTIRVPSFIMAEAFLSFI/JI^ISAPMTSWGE 
LVQNGIATFVEYPWKVF I PAIVMT I F LLFMNFLGDGLRDAFDPKDS I 

f600.nt 

ATGGCAATAATGGAAAGAAGTATAATCGGCTTATTCAT^ 

TTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAG^ 

AAGAATAATCTTT^AAACACTTGATCCCTAATAGCATTGGAATGATAGrTATA 

TTTATTATGGCTGAAGCATTTTTATCCTTTTTA 

TGCAAAATGGAATTGCTACATTTGTTGAATA 

ATTATTTATGAACTTTTTACX5TGATGGGCT 

t600.nt 

CX3AGTTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTT 

ATCAAAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTO 

AAGCTTTATTATGGCTGAAGCATTTTTATCCTTTTTAG 

\ 
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TTAGTGCAAAATGGAATTGCTACATTTGTTGAATATCCA 
TTCTATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCT^ 

f603.aa 

MLKFTLKKI LGI I PTLLVI I FLCFFVMRMAPGSPFDSEKPIDPQVKARIJ£EKYHLDK^ 

PSLKKKDLTVSQYIKLGFPKSLTLGVISLIISLSIG^^ 

FFAIKWGLLYTSGWITERGGFSNLILPIITLSM^ 

GAMLPWSYIGPAFAAI ISGSWIEKIFRIAGMGMFITESALNRDYPVLMGGLLVYSIILLI SILI SDI IYKILDP 
RV 

t603.aa 

SPFDSEKPIDPQVKARI^KYHIJJKPFYIQAFYYITNAI^GDLGPSL 
LSIGIPIGII^IYKNTYVDYIITSIAIUSISIPLFVIGP 

MPNVAI FARI IRGSMLEI IQSDFIRTARAKGLSFKKIVIKHMLRGAMLP WSYIGPAFAAI I SGSWIEKI FRIAG 
MGMFITESALNRDYPVLMGGLLVYS 1 1 LLX S I LI SDI I YKILDPRV 

f603.nt 

ATGTTAAAGTTTACTTTAAAGAAAATATTAGGAATAATACCAACTTTACTGG 

TAATGAGAATGGCTCCTGGAAGTCCATTTGATTCTGAAAAACC 

AAAATATCACCTTGACAAGCCTTTTTATATTC 

CCTTCTTTGAAAAAGAAAGACCTTACAGTTAGTC^ 

TAATATCCCTTATTATATCACTATCAATAG^ 

GGATTATATAATAACATCAATAGCAATATTGGGGATTTCAATACCATO 

TTTTTTGCAATTAAATGGGGTTTGCTCT 

TACCCATAATAACTCTTAGCATGCCCAA^ 

ACAAAGCGACTTTATAAGAACTGCGCGTGCAAAAGGGCTAAGCTTC 

GGAGCAATGTTGCCTGTAGTAAGCTATATAGGTCCAGCATTTGCTGCTAT 

AAATATTTAGAATTGCTGGAATGGGAATGTTTATAACAGAATC 

CGGATTGTTAGTATATTCAATAATACTGCTTATTTCTATATTAATATC 

AGAGTATAA 

t603.nt 

AGTCC^TTTGATTCTGAAAAACCTATTGATC 

CTTTTTATATTCAAGCTTTTTAOT 

CCTTACAGTTAGTCAATACATAAAATTAGGAT^ 

CTATCAATAGGAATACCAATAGGTATATTAGC 

TAGCAATATTGGGGATTTCAATACGATTATTC 

TTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGG^ 

ATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCT 

CTGCGCGTGCAAAAGGGCTAAGCTTCAAAA^ 

AAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGT^ 

ATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTA1T 

TAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTA^ 

f607.aa 

MKYIKIALMLIIFSLIACISNAKKEKrTFRVSNLSEPSSLDPQLSTO^ 
KSWNISEDGIIYTFNLREDIWSDGVAITAEEIKKSYLRII^^ 

AIDSKTLEITLTSPKPYFPDMLTHSAYIPVTlffilVEKYGENWTNPENIWSGAYKLKERSINDKIVIE 
KNVEIDEVIFYPTEGSVAYNMYINGELDFLQGAEKNNLEEIKIRDDYYSGLK^ 

SLAIDRETLTKVVLKGSSDPTRNLTPKFDDYSYGKNLILFDPENAKKLLAEAGYPIX^ PTLKYKI SEGRPTTAE 
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FLQEQFKKILNINLEIENEEWTTFI^ 

KKSNFELDPIKRQDILRQAEEI IAEKDFPMAPLYI PKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 
t607.aa 

CISNAKKEKIVFRVSNLSEPSSLDPQLSTDLYGSNI ITNLFLGIJVVKDSQTGKYKPGLAKSWNISEDGI IYTFNLR 
EDIWSIX^AITAEEIKKSYLRILNKKTAAMYANLIKSTIKNAQEY 

FPDMLTHSAYI PVPMH I VEKYGENWTNPENIVVSGAYKLKERS INDKIVI EKNEKYYNAKNVEIDEVI F YPTEGSV 
AYI^INGELDFLQGAEKNNLEEIKIRDDYYSGLKNGMAYIAFNT^ 

OTPTRNLTPKFDDYSYGKNLI LroPENAKKLLAEAGYPDGKGFPTLKYKI SEGRPTTAEFLQEQFKKILNINLEIE 

NEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFUDSLFTTENHFI^ 

QAEEI IAEKDFPMAPLYI PKSHYLFTWDKWTGWVPNI AESYLYEDIKTKK 

f607.nt 

ATGAAATATATAAAAATAGCCTTAATGCTAATAAT^ 

AAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCAT^ 

CAACATTATTACAAACCTATTCTTAGGCCT^^ 

AAAACTTGGAATATTTCTGAAGATGGAATTATTTACAC^ 

TTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTA 

TTTAATAAAATCTACAATAAAAAATGCACAAGAATO 

GCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGC 

CATACATACCAGTTCCAATGCATATTGTTGAAAAATATGG 

TGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGAAA 

AAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGT^ 

AACTCGATTTTCTACAAGGAGCAGAAAAGAAT^ 

AAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGACAATTT 

TCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGG 

GAAAATTTGATGACTATTCTTATGGAAA 

AGCTGGATATCCXX5ATGGGAAAGGATTCCCCACATTAAAATATAAAATATC 
TTTTTGCAAGAACAATTTAAAAAAATACTAAACATTA^ 

GAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAGACT 

AGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGC 

AAAAAATCTAATTTTGAACTTGATCCAA 

AAGACTTTCCTATGGCA-CCTTTATATATACCCAAATCT^ 

ACCAAATATCGCAGAAAGCTATTTATATGAAGATATTAAAACTAAAA^ 

t607 .nt 

TGTATTAGTAATGCTAAAAAAGAAAAAATAGTTTTCAGAGTATCA 

AACTCTCAACAGACCTTTACGGTAGCAACATTATTACAAACCTATTC 

TGGAAAATATAAACCAGGACTTGCAAAAAGTTGGAATATTTCTGAA 

GAAGATATAGTTTGGAGCGATGGAGTTGCCATTACTGCCGAGGAGATAAAA 

AAAAAACAGCTGCAATGTATGCTAATTTAATAAAATC^ 

GCCTGAATCTGAGCTTGGCATAAAGGCTATTGACAGCJy^AACC 

TTTCCTGATATGCTAACACACTCAGCATACATACCA 

CAAATCCTGAAAATATAGTTGTTAGTGGCGCATACAAACTTAAAG 

AAAAAATGAAAAATAOTATAATGCAAAAAATGTAGAAATTGATGAAGT 

GCTTACAATATGTACATAAACGGTGAACTCGATT^ 

TAAGAGATGATTATTATTCTGGGTTAAAAAACGGA^ 

CAATTTAAAAGTTAGACAAGCCATCTCCCTTGCCA 

TCAGATCCAACAAGAAATCTAACTCCAAAATTTGATGATTA 

AGAATGCAAAAAAACTTTTAGCTGAAGCTGGATATCCGGATG 

GGAGGGAAGACC^CAACAGCAGAATTTTTGCAAGAACAATTTAAAA 

AATGAAGAATGGACAACATTCCTAGGAAGCAGAAGAACTGG 

ATTATTTTGATCCCTTAACATTCTTAGACAGCTTATTTACAACAG 
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AAACAAAGAGTATGATGCTTTAATAAAAAAATCT 

CAAGCTGAAGAGATAATAGCAGAAAAAGACTTTCCTATGGCACCTTTATATATACC 

GAAATGATAAATGGACAGGGTGGGTACCAAATATCGCAGAAAGCTATTC 

A 

f611.aa 

MKKIFLFLFISFYLFGFEDSSLKIGIDDVYVEAHEEGFHLFIRKKPAIKSVILTESFEIPDKKKDVATYSFRTLSY 
NKVNGDEIRI LNGRVIKNKELLSLTSSTPVPNKKFGEAFHILI PKKLKYGF PNFSTRSGDIDLEVLKSKKEPFWFS 
IRSFEKKYNDYLGRYQDNAYELLFKDIX^NQGKIEFNELKDTFTKFSDEW 

DLVLWDVTDSMKSNI EI LKEHLF S 1 1 EPQLQKFKS YRIGLVFYKDYLEDFLTKAFDFNT I PYLNNI LKYVNVGGG 
GDYPEAVFEGIDAAVTQFDWRAERRFI IVIGDAPPHEYPRGS IVYKDVINSAKEKDITI YGI IFQ 

t611.aa 

FEDSSLKIGIDDVYVEAHEEGFHLF IRKKPAIKSVI LTESFEI PDKKKDVATYSFRTLSYNKVNGDEIRILNGRVI 
KNKELLSLTSSTPVPNKKFGEAFHILI PKKLKYGFPNFSTRSGDIDLEVLKSKKEPF^^S IRSFEKKYNDYLGRYQ 
DNAYEI^FKDDQNQGKIEFNELKDTFTKFSDEVVIAJ^ 

EI LKEHLFS 1 1 EPQLQKFKS YRIGLVFYKDYLEDFLTKAFDFNTI PYLNNI LKYVNVGGGGDYPEAVFEG I DAAVT 
QFDWRAERRF I IVIGDAPPHEYPRGS IVYKDVINSAKEKDITI YGI IFQ 

f 611. lit 

ATGAAGAAAATTTTTTTATTTCTTTTO 

TTGATGATGTTTATGTTGAGGCTCATGJ^ 

AATATTGACAGAGTCTTTTGAAATTCCTGATAAGA 

AATAAGGTTAATGGAGATGAAATTCGGATTTTAAATGGAAGAGTTA 

CTTCCACCCCTGTTCCTAATAAAAAGTTTGGAGAAGCT^ 

TCCAAATTTTTCAACAAGAAGTGGTGATATTGACT^ 

ATAAGATCTTTTGAGAAAAAATATAATGATTA 

ATGATCAAAATCAGGGAAAAATTGAATTTAATGAATTAAA 

TGCTAATAATGGCATTGATATTGTTGATAAAATAAACAAAATT^ 

GATTTAGTGCTTGTTGTTGATGTTACTGATAGTATGAAAAG 

TAATAGAACCTCAACTTCAAAAGTTTAAATCCTACAG^ 

TTTAACCAAAGCTTTTGATTTTAATACTATTCOT 

GGGGATTATCCAGAAGCTGTTTTTGAGGGGATTC 

TTATTATTGTTATAGGAGATGCACCTCCTCATGAGTATCCAAGAGGGTCT 
TGCAAAGGAAAAAGATATTACAATTTATGGAATAATATTT^ 

t611.nt 

TTTGAAGATAGTTCTTTGAAAATAGGTATTGATGATC 

TTAGAAAAAAACCTGCAATCAAATCAGTAATATTGACAG^ 

TACTTATTCATTTCGTACATTAAGTTATAATAAGGTTA^ 

AAGAATAAAGAACTTTTATCATTGACATCTTCCACCCCTGTTCCTAA 

TGATTCCAAAAAAATTAAAATATGGATTTCCAAATTTTTC 

AAGTAAAAAAGAGCCCTTTTGGTTTTCTATAAGATC 

GACAATGCTTATGAATTGCTTTTCAAGGATGAT^ 

TTACAAAATTTTCAGATGAGGTTGTTATTGCTAATAATGGC^ 

AAACTGAGAAGATTCAGTTTATGATTTAGATTTAGTGC^ 

GAGATTCTAAAAGAGCATTTGTTTTCAATAATAGAACCTCAA 

TTTTTTATAAAGACTATCTTGAAGATTTTT^ 

TCTTAAGTATGTTAATGTTGGTGGCGGTGGGGATTATCCAG 

CAATTTGATTGGCGGGCAGAAAGAAGGTTTATTATTGTT 

CTATTGTTTATAAAGATGTTATCAATTCTGCAAAGGAAAAAGATAOT 
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f617.aa 

MIFFRNSFMALIFSFSILSISYFFGDFFQFSYIKMISWR 

FSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKAL^ 

NSLFALDSSLIYLFLLGVSVWLFYWIHKKTIYGLQ^ 

VVFFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFIiNINTO 
IKARKK 

t617.aa 

AKSNSLNLGNEGQIYFGAFLVYIFSSFFGLTYFNFVFLILLSSFFVGLI/5LIPFFI 
RLVIXSFILNMLKTGSFSNQTKRIN^ 

EFKYI^FAVFGSAFI^GLAGSMFVVFFRPYLVLGLTSGLGWS SLIVAVT SGFNYVYVLFFSLLFSI LIEFNNFLNI 
NYDFKYEFIGLCQSIAIFISLFLIKARKK 

f617.nt 

ATGATCTTTTTTAGAAATAGCTTTATGGCATTAATTTTO 

ATTTTTTTCAATTTTCTTATATTAA^ 

TACTTGTGCCAAGAGTAATTCATTAAATCTTC 

TTTTCAAGTTTTTTTGGATTAACCTATTTO 

TGGGGCTTATCCCCTTTTTTATTACTTTTTTC 

AAATCAAAGATTGGTGGATGGATTTATTTTAAATATGTTAAAAAC 

AATAGTTTGTTTGCTTTAGATTCATCACTTATTTACTTO 

TTATTCACAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAA 

TATAAATGAATTTAAATATAAGTTTTTCGCTGTATTT^ 

GTAGTGTTTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGAC 

TTTCAGGATTTAATTATGTTTATGTATTATTTTTO 

TAATATAAATTATGACTTTAAGTATGAATTTATTGGGCTTTGTC 

ATTAAAGCTAGGAAAAAGTAG 

t617.nt 

GCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCA 

GTTTTTTTGGATTAACCTATTTTAATTTTGTA 

TATCCCCTTTTTTATTACTTTTTTC 

AGATTGGTGGATGGATTTATTTTAAATATC 

TGTTTGCTTTAGATTCATCACTTATTTACT^ 

CAAAAAAACTATTTATGGTCTTCAGCTTGAAAT^ 

GAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTG 

TTTTTAGACCATATTTGGTTTTAGGGCTAACTTC^ 

ATTTAATTATGTTTATGTATTATTTTTTAGCTTATTC 

AATTATGACTTTAAGTATGAATTTATTGGGCTTTGTC 

CTAGGAAAAAGTAG 

£631. aa 

MVVEINSIJITCYLLVLLLLVAYGLVVFYTSSFFLSLELTGNPNFLFFTO 

PVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISYWIKPMLIFAITO 
VLIILQNDYSTAIYFAILFFIVLFVSNM^ 

ASLNAXjKSGG I LGKGLGMGEVKLGKLPEANSDFI FSVLGEELGFLGVLFAI SLFFLFFYFGYFI AI HSNSRFKFFI 
AFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 



t631.aa 
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RISLNFLKKSIFPVLIITLFLIMATFLSPSISGAKRWIFFQGV^ 

WIKPMLIFAIFWVLIILQNDYSTAIYFAILFFIVLFVSN^ 

YDDPSGKGYQIIASLNALKSGGII^KGLGMGEVKLGKLPEANSDFIFSVLGE 

AIHSNSRFKFFIAFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 
f631.nt 

ATGGTTGTAGAGATAAATTCACTTAGGAC^TGTTATTTGCTTC 

TTTrTTATACTTCTTCCTTTTTTCTAA 

TTATCTTTTTTTAAGTTTTATGGTTTTTCT^ 

CCTGTATTGATTATAACTCTTTTTTT^ 

TATTCTTTCAAGGTGTTAGCATTCAACCTrc 

GAGGAAGTTTGACCCAAGAAAAAACAATGGTAOT 

GTGTTAATAATTTTGCAAAACGATTATTCA^ 

CTAATATGGCATTTAGCTATGTTTTTGCTATTGTG^ 

ACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATC^ 

GCATCrcTTAATGCTTTAAAAAGTGGAGGAATTTTAG 

TACCAGAGGCCAATTCGGATTTTATTTTTT^ 

AAGCTTG'l w l M X H ri > TTTGri u lTTTTACTTTGGTTAT^ 

GCATTTATTTCAAGTCTTGCAATTTTTCT^ 

CAGGGATAAATTTACCATTTTTTTCATCTGGGGGATC 

AAATGTTTCAAAAAATTTAAGTAATAATTGA 

t631.nt 

AGGATTTCTTTAAATTTTTTAAAAAAATCAATATTTC 

TTTTATCTCCAAGTATTTCTGGAGCA 

TAAAATATCTTTTACTATTTATCTTTGAGCTTATTTC 

TGGATAAAGCCAATGTTGATTTTTGCAATTTT^ 

ATTTTGCCATTCTTTTTTTTATTGTTTTC 

TTTTTTACCAGTTTCTGCTATATTCTTG^ 

TACGATGATCCTTCTGGCAAAGGTTACCAGAT^ 

AAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAATTACCAGAG 

AGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTAT 

GCTATTGATTCTAATAGTAGGTTTAAATTO 

TGAATATTTTAATTGCAATCGGTCTTTT^ 

TATTATTGTTACCATGGCATTGTCTGGCCTTATTTCA^ 

f647.aa 

MKVNNFLSFFFRAFFLLFLIVI LFFFVLFFIDFIGMYNTKRYFPEFVRTKLI/5ETSLVFDHNSNI I LDEARLVKER 
EAIDIKNQQIEIOiKEDLKLKEDSLN^ 

RLEDLNPEIAISYMRKIEELSKKEGRLSIVPYWIiSLMDSKKAAILIRKM 
t647.aa 

IDFIGMYNTKRYFPEFVRTKLIX3ETSLVFDHNSNIIIJDEARLVKEREAIDIKNQQI 

LKQKQKDLDLKQKI IDD I INKYNDEEANI LQTAVYLMNMPPEDAVKRLEDLNPELAI SYMRK I EELSKKEGRLS IV 
PYWLSLMDSKKAAI LIRKMSVS SLE 

f647.nt 

ATGAAAGTGAATAATTTTTTATCGTTCTTTTT^ 
TTGTATTATTCTTTATTGATTTTATTGGAATGTATA^ 
GTTAGGAGAAACTTCTCTGGTCTTTGATCATAATTC 
GAAGCTATTGATATTAAGAATCAGCAGATTGA 
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AGCTTGAATTTGAGCTTAAGCAAAA^ 

ATATAATGATGAGGAAGCAAATATTTTGCAAACAGCTGTA 

CGGCTTGAAGATTTAAATCCCGAGCTTGCAATATCTTATATG^ 

GTTTATCAATTGTTCCTTATTGGTTATCTC 

TAGTTCATTGGAGTAG 

t647.nt 

ATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGA^ 

CTCTGGTCTTTGATCATAATTCTAATATAATTCTTGAT^ 

TAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTO 

CTTAAGCAAAAGCAGAAAGATTTAGATTTAAAACA^^ 

AAGCAAATATTTTGCAAACAGCTGTATATTTA^ 

AAATCCCGAGCTTGCAATATCTTATATGCGGAAAAT^ 

CCTTATTGGTTATCTCTTATGGATTCTAA^ 

AG 

f653.aa 

MLTYGraiVTLLLVFFVPMFSLNDI I FQENVIRIMSASFTGAGPFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQ 

TAKNKSMIEFIEKIQSKNIVVRQEERGIVISLAADAFFDSA^^ 

TDNIDTDVNGPWKSNWELSAARSVNML^ 

TTDASLSFPKEIKQ 

t653.aa 

NDIIFQENVIRIMSASFTGAGFFKGGKTI^FSKIjSYLSNSFMSLPSTVRNKQ 

RQEERGIVT SLAADAFFDSASADVKLEENRDS IQKIASF IGFLS PRGYNFKIEGHTDNIDTDVNGPWKSNWELSAA 
RSVNMLEHILNYLIX2SDVKRIENNFEVSGFGGSRPIATDDT^^ 

f653.nt 

ATGTTGACTTATGGAGACATGGTTACTTTGCTGCTTGTC 

TTCAAGAAAATGTGATAAGAATAATGTCTGCTTC 

TTTTAGTAAATTATCTTATTTGAGTAATAG 

ACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAA 

GAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTT^ 

AGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTO 

ACAGATAATATTGATACTGATGTAAATGGACCTTGGAA 

TGCTGGAACATATTTTGAAOTATTTAGATCAATC 

TGGTGGAAGTAGGCCTATTGCAACAGACX^TACCCCTGAGGGTAGGGCI^ 
ACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

t653.nt 

AATGATATTATTTTTCAAGAAAATGTGATAAGAATAATGTCTGC 

GTAAAACTTTAGATTTTAGTAAATTATCI^ATTTGAG 

ACAAGCATCTCAGACTKXTTAAAAATAAATCCATGATTGAA 

AGGCAAGAAGAAAGAGGTATTGTAATATCTCTTGCAG^ 

TTGAAGAGAATAGAGATTCTATTCAAAAAATAGCATCTTO 

AATAGAAGGGCATACAGATAATATTCATACTGATGT^ 

AGATCTGTTAATATGCTGGAACATATTTTGAACT 

AAGTATCTGGTTTTGGTGGAAGTAGGCOTATTGCAAC^ 

TGATATATTAATTACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATO 



£664. aa 
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MRMSVYTMGFAYIRS IMGYWLFFFASLAVNFFVNI IQVGFFITFKSLEPRWDKI SFNFSRWAKNSFFS AGAFFNL 
FKSLLKVVIICLIYYFIIENNIGKISKLSEYTLQSGISrVLVIAY 

TKEEVKQERKEMEGDPLLRSRI KERMRVI LSTNLRVAI PQADVVITNPEHFAVAIKWDSETMLAPKVLAKGQDEIA 
LTIKKIARENNVPLMENKLLARALYANVKVNEEI PREYWEIVSKI LVRVYS ITKKFN 

t664.aa 

FVNIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNLFKSLLKV^ 
LQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEM 
NLRVAIPQADVVITNPEHFAVAIKWDSETMIJ\PKVLAKGQDEIAL^ 
EI PREYWEIVSKI LVRVYS ITKKFN 

f664.nt 

ATGCGTATGAGTGTTTATACTATGGGTTTTGCATATATTAGATCTATC 
CATCTTTAGCTGTTAATTTTTTTGTTAAT 

GTGGGATAAAATTAGTTTTAATTTTTCCAGATGGGCAAAAAAOT 
TTTAAAAGTTTGTTAAAAGTTGTTATAATAT 

CTAAGCTTTCGGAGTATACACTTCAATCTGGAATTTCTATTGTGTTA 

AGTAATGTTTTTGGCAATTGTAGGGGTGTTTGATTATTTG 

ACAAAAGAAGAGGTAAAGCAGGAAAGAAAGGAAATGGAAGGTGATC 

TGAGGGTTATTTTAAGTACCAATTTAAGAGTAGCTATTC 

TGCAGTTGCTATTAAATGGGATAGCGAAACAATGTTAGCTCCAA^ 

CTCACAATTAAAAAAATTGCAAGAGAAAATAATGTTC^ 

CTAATGTTAAGGTTAATGAAGAGATTCCAAGAGAATATTGGGAGATTC 

TATTACTAAAAAGTTTAATTAG 

t664.nt 

TTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTT 

ATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAG 

TGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATA 

CTTCAATCTGGAATTTCTATTGTGTTAGTGATTGC 

TAGGGGTGTTTGATTATTTGTTTCAAAGATC 

GGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAA 

AATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATC 

ATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGT^ 

AAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGC 

GAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAAT^ 

AG 

£680. aa 

MFTLSFVLINFIITGILII^LEFNFLKVDFKGNILL^^ 

SFLSLI P IVFGAI LLKHKEFYDIFMVLNFFEINLGALVAFWGI FS INFFFKMLNNKKL YYFS I YLFALSI I VCYF 
VRI 

t680.aa 

ITGILILMLEFNFLKVDFKGNILLAGIFMGLMQGLGALPGISRSGITIFSASVI 
ILLKHKEFYDIFMVLNFFEINLGALVAFWGIFS 



f680.nt 
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ATGTTTACATTGTCTTTCGTTTTAATTAATTTT 

TAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAGGAATTTT^ 

AGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGC^ 

TCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCG^ 

TTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAGT^ 

TAAAATGCTTAATAACAAAAAACTGTATTATTTTTC 

GTTAGAATATGA 

t680.nt 

ATAACAGGGATTTTAATCTTGATGCTAGAAT^ 

GAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTC 

ATCGGTTATTGGATTTAATAGAAAAAGTGCATTTC 

ATTTTATTAAAACATAAAGAATTTTATGATATTTTTAT^ 

TTGCTTTTGTTGTTGGTATTTTCTCAAT 

TATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTTG 

£688. aa 

MrVLLISIGCANAVHI INEIFKLIKKEQLS 

GVIISMIISLTVLPGIITLIPFAKKKSFEKEK^^ 

GLLKIEINFDEKDYFKESTSVKKTLNl^ 

INGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYI 

YAINTINEYMKNNKYHFSGVYDKVLIAKTM^ 

MRLFGITLNPATATIASVSMGVGVDYSIHFF 

SYKI I STLGAI IAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQZ 
t688 .aa 

YKTMGIFMSIGVIISMIISLTVLPGIITLIPFAKKKSFEKEKEN^^ 

VLIILGISFVGLLKIEINFDEKDYFKESTSVKKTL^ 

AFSAKTQSSSINGILKFTNFKIKKESPLEYKLPENKIILNK^ 

STEGIKKFEKYAINT INEYMKNNKYHFSGVYDKVLIAKTMVKEQVINI ITTLGS ITLLLMFFFKS IKTG 1 1 1 AI PV 
AWSVFLNF AVMRLFG I TLNPATAT I ASVSMGVGVDYS IHF FNTF I LQYQKNQI YKTALLES I PNVFNG I FANS I SV 
GIGFLTLTFSSYKI I STLGAI IAFTMLTTSLASLTLLPLLI YLFKPRVKLASNNNFKKLKQZ 

f688.nt 

ATGATTGTTTTACTTATTTCAATCGGATGCGCCAATGCTC 

AAGAAC^GCTOTCAAAAGAATCCATAAAAGCAACAATTAAAAAACTTAAAAC^ 

AACTGCATTTGGATTTTTATCTCTTACAACCTCTT^ 

GGAGTAATTATCTCAATGATAATCTCATTAACCGTT^ 

AGTCTTTTGAAAAAGAAAAAGAAAATAAACTAAATAAAATATC 

AATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATGGTCCTI^^ 

GGTCTTTTAAAAATCGAAATCAATTTTGATGA 

ACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGAAA 

TGCTAAAGCAATGCAAATATTAGACTTAATT^^ 

ATTAATGGCATTTTAAAATTTACAAATTT^ 

AAATTATACTAAACAAACTAATAAATTTGATAGATAAAAGCGATTGGAC 

CGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAAC^ 

TATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATCATTTCTC 

TAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAACTC 

TTTC^TTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTAGC^ 

ATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTGCATC 

ATTCAATTCATTTTTTCAATACATTTATTT^ 

AATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTC 
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TCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAA 
TTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGC 
ATAA 

t688.nt 

TACAAAACAATGGGTATTTTCATGTCAATTGGAGTAATT^ 
TAATAAGATTAATCCCATTTGCAAAAAAAAAGTCTTOT 

CCTTGAAAGACTTGCCAAACTAAATACGCAAATAACAAAATCTATATTAAAAAGA 

GTCCTCATCATACTGGGAATTTCTTTTGTAGGTCTTTO 

AAGAAAGCAC^GTGTAAAAAAAACATTAAACCTAATGCAAAAAGAAAT^ 

AATTGAAGGCAGGCCCGGTGAATTTAAAAATGCTAAAGCAATGCAAATAOT 

GCATTTTCTGCAAAAACTCAATCTAGTTCTATTAATGGCA 

(XCCACTAGAGTATAAACTGCCTGAAAATAAAATTATACTAAACAAACTAATAAATTTGATAGATAA 

GACTAAGGACAATAAAAGAATGTACATTAACGATGACTGGTCATTAATATCTATCATAGTAAG 

TCAACCGAAGGAATAAAAAAATTTGAAAAATATGCTATTAACACAATTAATG 

ATTTCTCAGGTGTTTATGATAAGGTATTAATAGCTAAAACAATGGTAAAAGAA 

TCTTGGATCAATAACACTACTACTTATGTTTTTCTTTAAATCTATAAAA^ 

GCATGGTCAGTGTTTTTAAACTTTGCTGTAATC 

CATCTGTAAGCATGGGAGTAGGAGTAGATTATTCAATTCATTTTTTC 

TCAAATCTACAAAACTGCACTTCTTGAATCAATACCCAATGTATTTAAT^ 

GGAATAGGATTTTTAACTCTAACATTTTCGTC 

TGCTAACGACATCTCTTGCATCACTAACTCTTCTTCCATTATO 

CTCAAACAACAATTTTAAAAAATTAAAACAATAA 

f704.aa 

MNYTKFQEFI SEFLGTFI LLAIiGTGSVAMTVLFS S SPEI PGEI IKGGYTNI VFGWGLGVTFGI YTAARMSGAHLNP 
AVSIGLASVGKFPVSKLLHYIVAQIIXSAFTGALMT^ 

TFLLMFLI SWGDFTKKHSDNPFI PFIVGAVVLSIGISFGGMNGYAINPARDLGPRILLLFAGFKNHGFNNLSIVI 
VPI IGP 1 1 GAI LGATI YEFTLKNNKD 

t704.aa 

GEI I KGGYTNI WGWGIX^VTFG I YTAARMSGAHLNPAVS I GLASVGKF PVSKLLHYIVAQ I LGAFTGALMTLWFY 

PKWIEMDPGLENTQGIMATFPAVPGFLPGFII^ 

GMNGYAINPARDLGPRILLLFAGFKNHGFNNLSIVIW 

f704.nt 

ATGAATTATACAAAATTCCAAGAATTTATATCGGAAT^ 

CTGTTGCAATGAGA^GTATTATTTTCCTCAAGTCCCGA 

AGTATTTGGATGGGGATTGGGTGTAACGTTTGGTA 

GCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTC 

TAGGAGCTTTTACAGGTGCATTAATGACACTTGTCX^^ 

AAATACTCAAGGAATAATGGCAACTTTCCCTGCTC 

ACTTTTTTGCTAATGTTTTTAATTTCTG 

TTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCG^ 
GGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATT^ 
GTACX^TAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTACA^ 
ACTAA 

t704.nt 

GGAGAAATAATAAAAGGAGGATATACAAATATAGTATTTGGA 

CAGCAAGAATGAGCGGAGCACACCTAAACCCAGCTGTTAGCATAGGATTAGCAAGTGT^ 
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AAAACTTTTACATTACATTCT 

(XTAAATGGATAGAAATGGATCCTGGCTTAGAAAATACTCAAGGAATAATGGCAAC 
TTTTGCCTGGATTTATTGATCAAATTTTT^ 

AAAAAAACACAGCGACAATCCATTTATTCCTTTTATTGTAGGAGCAG 

GGAATGAACGGTTATGCTATTAATCCTGCAAGGGATCTGGGACCAAGA^ 

ATCACGGATTTAACAATCTAAGTATAGTTATTGTACCAATAATTGGCCC 

AATTTACGAATTTACACTAAAAAATAACAAAG 

ACTAA 

f707.aa 

MRRLFLLYILCSFVFLNLFAQGSSSYIDKQKEIiAIFYYEVGQRYINVGKIK^^ 
VKELDARIKDDNPKVVMLEDIKLEEIPGI 

TOLFDSVIADKVNVMGQFESKNDFISTLSSASSKADADELEYLSVDDYTO 
KNFPFWKERQTLIFTTEDDNNWFLSSINZ 

t707.aa 

MRRLFLLYI LCSFVFLNLFAQGSS SYI DKQKELAI FYYEVGQRYINVGKI KKGKLFQAKALKI YPDLKKGFDIKLA 
VKELDARIKDDNPKVVMLE^IKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDK IKFQFGKFARALI SRN 
FDLFDSVIADKVNVMGQFESKNDFI STLSSASSKADADELEYLSVDDYYDLKSLKI SKSNDT S FAVNVNAKKNDVT 
KNFPFWKERQTLIFTTEDDNNWFLSSINZ 

f707.nt 

ATGAGAAGATTATTTCTTCTATATATTTTATGTTCT^ 

ATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATG^ 

AAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAA 

GTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATG^ 

TACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCC 

AGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAA^ 

TTTGATTTGTTTGATTCAGTTATTGCGGATAAAGT^ 

CAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGAT^ 

AAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTT^ 

AAAAATTTTCCATTTTGGAAAGAACXST^ 

CCATAAATTGA 

t707.nt 

CAAGGTAGTTCTTCTTATATTGATAAGCAAAAAG 

ACGTTGGTAAAATTAAAAAAGGAAAGCTTTTTCAAGCAA 

TGATATGAAGCTTGCAGTTAAAGAGCTTGATC 

ATTAAGCTTGAGGAGATACCTGGAATAGTGCACGAAAAAATAGAAATCAAT^ 

AATATATTGCTCAAAGAGAGAGAAGCAAAAATCAAGATAAA 

TTTAATTTCTAGGAACTTTGATTTGTTTGA 

AAAAATGATTTTATATCAACTTTATCAAGTGCTTC^ 

ATGATTATTACGATTTAAAGTCTTTAAAAATTTC 

AAAAAATGATGTTACTAAAAATTTTCCATTTTGGAAAGAACGTC 

AATTGGTTTTTGTCTTCCATAAATTGA 

f709.aa 

MLIFGFIGLFFLNIFSLHAQGIVTNKDAQEEFKWAI^SYNNGIYD 

YVEEALMEWRNLKDQG YKVPYLRHL I S T I EQRRG I F SNYELNFKKLVKVASLDNS I YKRPHGYQ I TSLRADKYGGY 

YAANFVGNEILYFDVNNNVNALVKDGFSYLKSPYDVIEANNLLYVTLYSSDEI 

LIAPQYMAIDIQINYIWSEWGNKRVSKFGLEGDFILHFGSRTSGYKGLI^ 
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SGNHLYSVFTSIEGIEGLSSDFVGNNVI^ 

AKVSVYKSDASLYDSLNVDVRRI IRLGGPKI YVELNVS SKSGLPWGLKSENFS I SNENYY I VNPKVAYNVNASKD 

INIAVVFDKSSYMKKYDTDQIVGLNALMELSKNra 

LlKLAGSGLMSKSSRRAVVYFSGGILNRKAFEKYSLOT 

FSSYEGVSKVYDLI LEQKTGTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYF IN 
t709.aa 

QGIVTNKDAQEEFKWAIiNSYNNGIYDDALLSFKKILSFDPNNLDYHFWTG 

PYLRHLI STI EQRRGI FSNYELNFKKLVKVASLDNS I YKRPHGYQI TSLRADKYGGYYAANFVGNEI LYFDVNNNV 
NALVKIX^FSYLKSPYDVIEANNLLYVTLYSSDEI^ 

WGNKRVSKFGLEGDFILHFGSRTSGYKGLl^PTGVTYLNENIYVADSLRNTIEVFOT 

SDFVGNNVIVSSKDGVYKYS IAKKTITKILKADKMNSKISSS I LDANNQMIVSDFNNAKVSVYKSDASLYDSIjNVD 

VRRIIRLGGPKIYVELWSSKSGLPWGLiKSENFSISNENYYIVNPW 

QIVGLNALMELSKNKNFSFINATSVPIIDNIESLTO 

FSGGILNRKAFEKYSLDTIVSYYKNNDIRFYLILFG^ 

GTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYFIN 

f709.nt 

ATGTTAATTTTTGGTTTTATTGGTTO 

AAGATGCTCAAGAAGAGTTTAAATGGGCTCTTAATTCTTATAATAATG 
TAAAAAAATTTTAAGCTTTGATCCTAATA^ 

TATGTTGAAGAAGCriTAATGGAATGGAGAAATTTAAAAGATCAAGGCT 

TTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAAT^ 

TCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCA 

TACGCTGCTAACTTTGTAGGCAATGAAATATTGTATTTTC 

GCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTC 

TGAAATTGGTGTTTATGACAAAGTTCTTGGAGOT 

TTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAA^ 

GTAAATTTGGACTTGAAGGTGATTTTATTTTGCATT^ 

CACAGKX?GTTACTTATTTGAATGAAAACAT 

AGTGGTAATCATTTATATTCAGTTTTTACTTC 

ATGTTATAGTATCCTCAAAAGATGGTGTTTATAAATATAGCATTGCT 

AGATAAAATGAATTCTAAAATTTCTTCATCT^ 

GCCAAGGTTTGAGTTTACAAGAGTGATGGAAC^ 

TTGGAGGGCCTAAAATTTACGTTGAGCTTAATGTTAGCAGT 

AAATTTTTCAATTTCAAATGAAAATTATTA 

ATTAATATAGCAGTTGTTTTTGATAAATCTTCTTATATGAA 

CXCTAATGGAGTTGTCAAAAAATAAAAACTTTAGTTTTATAAATG^ 

AAGCTTAACAAATAGCATTAGAAATACAAGTO 

TTGAAGTTGGCAGGTTCTGGGCTTATGTCAAAAAGCT^ 

ATCX3TAAAGCTTTTGAAAAGTACTCTTTAGATACAATAGTAAGC 

AATACTATTTGGTAATGATCCTATTAATAGTAA 

TTTTCATCTTATGAAGGTGTATCTAAAGTTTATGATT^ 

ATTATTATCCAGGCCCTCAAGAACCTAATAAATATTTTAATTTATCTC 

AAGAGGGGAGTTTGCATATTTTATTAATTAG 

t709.nt 

CAAGGAATAGTTACTAATAAAGATGCTCAAGAAGAGTTTAAATGGGCT<^ 

ATGATGCTCTTTTATCTTTTAAAAAAATTTTAAGCT^ 

TGTTTATTATAGACTGGGTTATGTTGAAGAAGCTTTAATGGAA 

CCCTATCTTAGACATTTGATTTCTACTATTGAGCAAAGGAGAGGTATT^ 

AACTTGTAAAAGTTGCTTCTCTTGATAATTCTATTC 

TGATAAGTACGGCGGATATTACGCTGCTAACTTTGTAGGCAATGAAATATTO 



WO 98/59071 



PCT/US98/12718 



108 

TABLE 1. Nucleotide and Amino Acid Sequences 

AATGCTTTAGTTAAAGATGGCTT^ 

TGACTCTTTATTCAAGTGATGAAATTGGTGTTTATGACAAAGT^ 

AGGCACAAAAGATGGCGAATTGCTTGCTCCTCAGTATATGGCTATTC 

TGGGGAAATAAAAGAGTAAGTAAATTTGGACTTGAAGGTGATTTTAT^ 

ATAAGGGCCTTTTAGGTCCCACAGGCX3TTACTTATTTGAATGAAAA 

CATTGAAGTTTTTGATACTAGTGGTAATCATTT^ 

AGTGATTTTGTAGGTAATAATGTTATAGTATCCTC 

TTACAAAAATTTTAAAAGCAGATAAAATGAATTCTAAAATTTC 

TGTCTCAGATTTTAATAATGCCAAGGTTTCAGTTO 

GTTAGAAGAATAATTAGGCTTGGAGGGCCTAAAATTTAC 

TTGTTGGGCTTAAAAGTGAAAATTTTTCAATTTC 

TGTAAATGCTTCAAAAGACATTAATATAGCAGTTGTTTTTGATAAAT^ 

CAAATTGTAGGGTTAAATGCCCTAATGGAGTTGTCAAAAAATAAAAACTTO 

CCATTATAGATAATATTGAAAGCTTAACAAATAGCATTAGAAATACAAG 

TGTAAAAACAGACGTTAGTTTGAAGTTGGCAGGTTCTGGGCTTATC 

TTTAGTGGTGGTATTTTAAATCGTAAAGCTTTTGAAAAGT^ 

ATGATATAAGGTTTTACTTAATACTATTTGGTAATGATCCTATTAATAGTAAGC 

AGGCGGTGCTGTAATTCCTTTTTCATCTTATGAAGGTGTATC 

GGCACTTATTTGTTGGAATATTATTATCCAGGCCCTCAAGAACCrAATAAA 

ATATAAATCAACAGACAGGAAGAGGGGAGTTTGCATATTTTATTAATTAG 

f730.aa 

MIKSILDYLLTLHPVLLGLLGSTFTWFTTAFGAAAW 
LGYITWVPAWGFLVGAFFIYIVDVFVPDLDKLTFIDEDM 
DIQTLVGAMLLTLGIGIQNIPEGAAISLPU^^ 
AGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGFTLMMFLDVSriGZ 

t730.aa 

AWFFRKVDNKIMDAMIX3FSAGIMIAASFFSLIQPAIERAEEI/3YITWVPAWGFLVGAFFIY 
FIDEDLTKHGKKDFLLFTAVTLHNFPEGIAV^ PEGAAI SLPLRRGN 

VALAKC FNYGQMSGLVE IVGGLMGAYAVYS FTRI LPFALAF SAGAMI YVS I EQLI PEAKRKDI DNKVP S I FGVIGF 
TLMMFLDVSLGZ 

f730.nt 

ATGATAAAATCAATTTTAGATTATTTATTAAC 

GGTTTACTACAGCTTTTGGAGCAGCAGCAGTTTTTTTC 

TGGTTTTTCAGCTGGCATTATGATAGCGGCCAGTTTT^ 

CTTGGATACATTACTTGGGTGCCX3GCTGTTTTTC 

TTGTTC(^GATCTGGATAAACTTACTTTTAT^ 

TACTGCTGTTACTTTACATAATTTTCCAGAAGGATTGGCTGTT^ 

GATATTCAAACTTTAGTTGGGGCTATGCTTC^^ 

TTTCTCTGCCTTTAAGAAGAGGTAATGTTGCTTTGGCA 

AATTGTGGGGGGGCTTATGGGTGCTTATGCGGTTTATTCTTO 

GCAGGAGCTATGATTTATGTGTCAATTGAACAATTAATACCTGAA 

CAAGTATATTTGGTGTTATTGGTTTTACATTAATGATC 

t730.nt 

GCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATG^ 

CGGCCAGTTTTTTTTCGCTTATTCAGCCTGCT 

TGTTTTTGGATTTCTTGTTGGGGCAW 

TTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTCT 
CAGAAGGATTGGCTGTTGGAGTTGCTTTTGG 
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GCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGC 

GTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATC 

ATGCGGTTTATTCTTTTACTCGAATTTTACCTTT^ 

TGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGAC^ 

ACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 



fl97.aa 

MLLKLKYRFVGFLLLFLIFILLLFSTIFNFVLCGYLEDY^ 
TISEFAMSENRGKDFSESELIDLRKNPKFVTDSVKVSKKYRQYLYN^ 
NNMDFGHSEANTNYFKKAVEDYRQNQLKFIGWYSNLSTC^ 
YLLOTIVADSFT)RFRFGFYKRGNFIYVDPNNIAVNPFEEYNETSRV 
DRILLS EMGEIXTYYAMLPISSKLGEKSGVLIARLPYKDIYGVISSI^ 

IJ^IRVLVQDMVKGNIJ)KDYAIjDDDENTIJDELGML S LQWKMKKAI SVAI SSVLRNI SYVNKASLEVAS S SQNLS S 

salqqasaleemsanveqiasgvnmsannsyeteqialktnen 
llalnaazeaaragdegkgfawaseirkladlskisaleige 

segs skqsdqiaqfkmaldqvgewqssassseqls smsdkmlekskelrksvlffkikdski enpenddydfrli 
dcpensfkdenqnlksngistsnasghnnysi^ 

tl97.aa 

vlcgyledyykqltraqvrraafsi^sfldtlhviingaasnlalet^ 

idsvkvskkyrqylynfmanlkndtlfeefaffdfeg 

gwysnlsegisaevairskqsekkafaixwwspedklvcgyiagylln^ 

niavnpfeeynetsrvsskflnvlkdvfskppfpsniasev^ 

iarlpykdiygvisslrfqyilysvlgiialsivlsiridriisfruiair^ 

ELGMLSI^VVKMKKAISVAISSVIiRNISYVNKASL^ 
YETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEI 

ADLSKI SALEIGELVEDNSKVATEAGVTFKEMLPEIEETANLVKKI SEGSSKQSDQIAQFKMALDQVGEVVQSSAS 

SSEQLSSMSDKMLEKSKELJIKSVLFFKIKDSKIENPEmDYDFRLIIXrPENSFKDENQNLK 

SLDIESESSVRTINKRVDPKKAIDIADKDLNFDDDFSEF 

fl97.nt 

ATGTTATTGAAGCTTAAATACAGGTTTGTT^ 

CGATTTTTAATTTTGTTTTATGC^ 

AGCTTTTTCTTTGCAATCTTTTTTAGACACCCTG 

accatatcagaatttgcaatgtctgagaatag^ 

atccaaaatttgttattgactctgtaaaggtgagc^ 

taaaaatgataccctttttgaagaattcxk:ttttt^ 

AATAATATGGATTTTGGTCATTCTGAGGCTAATACCAA^ 

AATTAAAATTTATAGGTTGGTATTCAAATCTTTCTGAAO 

CGAAAAAAAGGCTTTTGCAATAATTGTACCTGTATATTCCCCAGAAGATA 

TATTTGCTTAATGATATTGTGGCAGATAGTTTTGATA 

ATGTGGATCCCAACAATATAGCTVGTTAATCCTT^ 

GAATGTTCTTAAAGATGTTTTCTCTAAGCCCC 

GATAGAATACTTTTGTCCGAAATGGGAGAAGATTC^ 

AGAGTGGAGTAOTTATTGCTAGGCTTCCTTATAAGGATATTTACGGACT 

TTTATATTCAGTCTTAGGCATTATAGCATTAAGTATTGTTOT 

TTAAACGCAATTAGAGTTCTAGTTCAAGATATGGTTAAGGGCAA 

AAAATACTCTTGATGAACTTGGCATGTTAAGTCTTGA^ 

TAGTGTTTTGAGAAATATTAGCTATGTAAATAAGGCAAGTTTAGAAGT^ 

AGTGCATTGCAACAGGCATCTGCTCTTG 

GCGCCAATAATTCTTATGAAACAGAACAAATAGCTTTAAAGACGAATGAA 
TGAAGAATCTGTTATTGCTATGCAAGACATTC 

TTACTTGCTTTGAATGCGGCTATTGAAGCTGCAAGAGCAGGAGATGA 
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AGATTAGAAAGTTGGCTGATTTGAGTAAA 

AGCAACTGAAGCGGGAGTGATCTTTAAAGAAATGCTACCCGAAA 
TC^GAAGGTAGCTCTAAGCAAAGCGATCAGATTGC^ 

AATCTTCAGCTTCAAGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTT 

ATCTGTATTATTTTTCAAAATTAAAGATTCTAAAATTGAA 

GATTGTCC^rcAAAATTCTTTTAAAGATGAAAATCA 

GGCATAATAATTATTCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTAT^ 
AGCTATCGATATTGCTGATAAGGATTTAAATTTTGATGATGA 

tl97.nt 

GTTTTATGCGGTTATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGT 

AATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTC 

TGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAAT^ 

ATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAAT^ 

TTTTTGAAGAATTCGCTTTTTTTGATTTTC 

TGGTCATTCTGAGGCTAATACCAATTATTTTAA 

GGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCA 

TTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTC 

TATTGTGGCAGATAGTTTTGATAGATTTAGATTC 

AATATAGCAGTTAATCCTTTTGAAGAATATAATG^ 

ATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTC 

GTCCX5AAATGGGAGAAGATTGTTATTATGCAATGTTGCCCA 

ATTGCTAGGCTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATT^ 

TAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATO 

AGTTCTAGTTCAAGATATGGTTAAGGGCAATTTA 

GAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATC 

ATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTO 

GGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGGAAATAGCCTCAG 

TATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGT^ 

TTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGC 

TGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTO 

GCTGATTTGAGTAAAATTTCTGCTCCT 

GAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAA 

TAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATC^ 

AGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGT^ 

TCAAAATTAAAGATTCTAAAATTGAAAATCGAGAAAATGATGATO 

TTCTTTTAAAGATGAAAATCAAAATTTGAAAAG 

TCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAA^ 

CTGATAAGGATTTAAATTTTGATGATGATTTTTC 

f200.aa 

MTI SKNVFSKF I LKFLNS SAFVSVFALFVGFL IVGLWMGLGHS PFRMYFI I LEI I FS S PKHLGYVLSYSAPLIFT 

GI^IGISLKAGLFNIGVEGQFIIiGSIVALIASVLLDLPPILHVITIFIITFIiASGSI^ 

;mFNWILFHLNNIIIJ)FSFIKRDNSD^ 

FAI LIWFLLNKTI IGFKINATGSNIEASRCMG IN^KAVLI FSMFLSAAVAGLAGAIQLMGVNKAIFKLSYMQGIGF 

NG3AASLMGNNSPIGIIFSSILFSILLYGSS 

ILD 



t200.aa 

LVVMGIX3HSPFRMYFIILEIIFSSPKHIX3YVLSYSAPLIFTGLSIGISLKAGLFNIGVEGQ 

DLPPILHVITIFIITFIASGSLGILIGYL^^ 

YIDFLASWKLSPEGIAYRSSHPFVNELLKAPI 
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KAVLIFSMFLSAAVAGI^GAIQLMGVNKAIFl^ 
LMGLPSSIVSI^GIIVLVISASYFLNKIVLKGVKRVKYIONILD 

f200.nt 

ATGACAATTAGTAAAAACGTATTTAGTAAATTTATTTTGAAATTTTT 

CTCTATTTGTTGGATTTTTAATTGTTGGGCTAGTG^ 

AATATTAGAAATTATTTTTTCTTCTCCCAAACATTTAGGTT 

GGTCTTTCTATTGGTATTTCTTTAAAAGCGGGTCTTTOT 

TTGTTGCTTTAATAGCATCAGTTTTACTTGAT^ 

TTTAGCTTCAGGCAGTTTAGGAATTTTAATC 

ATAATGTTTAATTGGATATTATTTC^TTTAAATAA 

ATTTTTCAAAACCCATTAAAGAAAGCGCATATATT^ 

TTATAGATCTTCTCATCCTTTTGTTAATGAGCTTTTAAAAGCACCTC 

TTTGCTATTTTAATATGGTTTTTACTTAATAAAACTATTATTC 

AAGCTTCAAGATGTATGGGTATTAATGTAAAAGCTGTGCT 

TCTTGCTGGTGCTATTCAACTTATGGGTGTTAATAAAGCTA 

AATGGGATAGCTGCTTCTCTTATGGGAAACAATTCGCCAATTGGCATAATA 

TGCTTTATGGAAGGAGTAGAGTTCAAAGTTTAATGGGCCTTCC^ 

TGTTCTTGTAATTTCTGCTAGCTATTTTTTAAATAAAATTC 

ATTCTTGATTAG 

t200.nt 

GGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTAT 
CCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTC 
AGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTATTO 
CTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTT 

TAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGAATA^ 
TTTAAATAATATAATTTTAGATTTTAGTT 
GCATATATTGATTTTTTAGCTTCTTGGAAGCTC 
ATGAGCTTTTAAAAGCACCTCTTCATTTTGGAAT 

TAATAAAACTATTATTGGATTTAAAATAAATGCCAGAGGAAGTAATAT^ 

GTAAAAGCTGTGCTAATTTTTTCAATGTTTC 

GTGTTAATAAAGCTATATTTAAGCTTTCTTA^ 

AAACAATTCGCCAATTGGCATAATATTTTOTAGGATTC 

AGTTTAATGGGCCTTCCATCTTCAATTGT^ 

TTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATAT^ 
£208. aa 

MVKKFSIFLKAIIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFOT^ 
FDIIHCLIPIiAFYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLIiRFN^ 

LIPFTFFIIISSSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYR^ 

EMEKTKFYIDKYLQRKSDSILGIFLFTLFASFTIFL*^^ 

KVYELAKSFNNLLLKEKLNSKRKSKI PLEI EKVKKI INKNQEIK 

t208.aa 

IIIFSIFELLIEELSIILFLPYKIRFALIFMFLFiyriFIFIFLYKITK^ 
FYSSYQLKNIIVAHETILtTCIMLSLFKI^ 

SSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVIEMEKTKFYID 

YLQRKSDSILGIFLFTLFASFTIFLMNFYKFFKASF^^ 

LLLKEKLNSKRKSK I PLEI EKVKKI INKNQEI K 



f208.nt 



WO 98/59071 PCT/US98/12718 

112 

TABLE L Nucleotide and Amino Acid Sequences 



ATGGTAAAAAAATTTTCAATTTTCTTAAAAGCAA 

CAATAATTCTTTTTTTACCATACAAAAT^ 

TTTCATTTTTTTATACAAAATAACC^ 

TTCGATATAATCCACTGCCTTATTCCTTTAGCGTTTTATAGCTCATA 

AAACAATATTAAATCCAATAATGCTATCACTTTTCAAGTTAAGATTTO^ 

AATAGAAATATATTACAATTCAAAAGAAAAGAACCTAATA 

TTAATACCATTTACATTTTTTATAATAATATCAAGCTCAAAAATTGT 

ATATCATTAAAAATATATCAATAATAAATGAAAAAGCTTACATTAAAGAAAA 

GGAAAAAGATGACATAATATACTGAAAATCAGACGAAATATTTGTTTACTACAGTC 

GAAATGGAGAAAACAAAATTTTATATAGATAAATA^ 

TTACATTGTTTGCATCATTTACTATTTTT^ 

TATTTTAATGACAAAAATTTTACAAGACCCATTAGAATATCGAAA 

AAAGTATATGAACTTGCAAAATCATTTAACAATCTCTTGCTAAAAGAAAAACTAAA 

TACCTTTAGAAATTGAAAAAGTAAAAAAAATAATTA^ 

t208.nt 

ATAATAATTTTTTCAATATTTGAACTTTTAATC 
TTGCACTAATATTTCTTGGGTTTCTATTTGACACAATT^ 

CCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTCTTCGATATAATCCA 

TTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATGAAAC 

TCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAATAAT 

CCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGC^ 

AGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAG 

AAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAAGGAAAAAGAT^ 
CGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAG 
TATTTGGAAAGAAAAAGCGATTCTATTCTTGGAATTTT^ 
TGAATTTTTATAAATTTTTTAAAGCAAGCTT 

AGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAAAAAGTATATC 

CTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAAGCAAAAT^ 

TTAATAAAAACCAGGAAATAAAATGA 



f210.aa 

MKIQIIIMLLALLDFPLNARLLDISI^ 

TNYSLLNSNYKEANKnjIQSELIDKKFLKYKIFKIKNINGIFKSHSLIYTK^ 

YFLKNIjDKISNEMIFFPREKREVNMIQKTTIAADSSSKPRGINYDTGIPF 

IDTAAIXJEEAVIKYKNHYPNIDIWIjDIT^^ 

KPLDRAKVLQRVMSVFVK 



t210.aa 

RLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEIEKNIYKLTEHFVKS^ 

SELIDKKFLKYKI FKI KNINGI FKSHSLI YTKKGFYKLELYI ENNAEPLK I FNLNITYFLKNLDKI SNEMI FFPRE 
KREVNMIQKTTIAADSSSKPRGINYDTGI PFNVLIYDDSVFTVKQLTQIFTSEGFNI IDTAADGEEAVTKYKNHYP 
NIDIVTLDITMPKMIXSITCLSNIMEFDKNARVI^ 

£210. nt 

ATGAAAATTCAAATAATTATAATGCTGCTTGCA 

TTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCOTATAATT^ 
TTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACT 
ACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCAAATAAATATCTAA 
AATTTTTAAAATATAAAATATTTAAAATCAAAAATATAAA 
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AAAAGGATTTTACAAATTAGAA^ 

TATTTTTTAAAGAATTTAGATAAAATAAGTAATGA^ 
t210.nt 

AGACTTTTGGACATTTCAATTGAAAAAAGAGCAGAT^ 

AAAAAGAATACTATACCAATTTTCCAACAAGCGAAATAGAAAA7UVATATTTATAAACTAACA 

AAGCATAATGCTCAATAAAACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCA^ 

AGCGAACTCATTGATAAAAAATTTTTAAAATATAAAAT^ 

ATTCACTAATATATACAAAAAAAGGATTTTACAAATTAGAACTTTACATAGAA 

ATTTAACCTTAACATTACTTATTTTTTAA 

TGA 

£22 . aa 

MLKTLTKIITISCLIVGCASLPYTPPKQN^ 

KKENNDFALLIMGNF PKDI FWGIHKNRNTES IGNI FTNPKWKLKNSNI YI I PNKARTS IAITQKDITAKDNNMLTT 

KYIGEIEKNEMFFWIQDPTLLLPNQrVSSKl^I^^ 

TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22.aa 

PYTPPKQNIjNYIiMELLPGANLYAHVNLIKNRS I YNSI^PKYKSVI/3LI SNLYFSYKKENNDFALLIMGNFPKDI FW 
GIHKNRNTES IGNI FTNPKWKLKNSNIYI I PNKARTS I AITQKDITAKDNN^ 
LPNQIVSSKNLIPFSSGTLSINSI^QEEYIFKSLIKTNNPPILKIL^ 
VEIEFNIQKSSVESLIEKLASNIQT 

f 22. nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTC 

CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGG^ 

CAGGTCTATTTATAACTCTTTAAGCCCT 

AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTC 

ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAA 

TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAA 

AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTT^ 

TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGG^ 

TTTTAAATCCTTAATCAAAAGAAATAATC 

ACAAACATGACAAACCTCACAATATCAAGC(^CA 

TTAATATTCAAAAATCTAGTGTTGAAAGCCTTAT^ 

t22.nt 

CXTTTACACTCCTCCAAAACAAAATCTAAATTACT^ 

ATTTAATTAAAAACAGGTCTATTTATAACTC 

ATACTTTAGCTATAAAAAAGAAAATAACGATTTTGCTCTAOT 

GGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATT^ 

ATATATACATTATTCCAAAGAAAGCTAGAACT^^ 

TATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGT^ 

CTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTT 

AAGAAGAATATATTTTTAAATCCTTAATCAAAA 

TCCAACCGTCTTGACAAACATGACAAACCTCACAAT^ 

GTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGC 

AA 



f221.aa 
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MG I TVFYLF S I FAS FVLGS SMDSVKENVLKST I FYYDVEEVEF PYARKQTLQF IAKTHLKYAVFNFDKNKMF S YT F 
VFDKKL I SQ YAI F I EVKKKFGEATLVT PLNYLWDLGDS I IVLNKNILRITLKSYISNYNK 

t221.aa 

SMDSVKENVLKST I FYYDVEEVEF PYARKQTLQF I AKTHLKYAVFNFDKNKMF S YTFWDKKLI SQYAI F I EVKKK 
FGEATLVTPLNYLWDLGDSI I VLNKNI LRITLKSYI SNYNK 

f221.nt 

ATGGGTATTACAGTTTTTTATTTATTTTCTATTTO 

AGAATGTTCTCAAGAGCAOTATTTTTTATTATGATC 

ACAATTTATTGCTAAAACCCATTTAAAATATGCTC 

GTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTT 

TAACGCCTTTGAATTATTTATGGGATCTTGGTGATTC 

AAAATCTTATATTTCAAATTATAATAAATGA 

t221.nt 

AGCATGGATTCTGTTAAAGAGAATGTTCTCAAGAGCACTAT^ 

ATGCTAGGAAGCAGACTTTACAATTTATTGCTAAAACCCATTTAAAATAT 

AATGTTTTCGTACACTTTTGTTTTTGATAAAAAAT^ 

TTTGGCGAGGCTACACTAGTAACGCCTTTGAATTATTTATGGGATC 

ATATTTTAAGAATTACTTTAAAATCTTATATTT 

f253.aa 

MYMENIEVRGQPNFFGLI PFFVFI I IYLGTGI YLGVIGVEMAFYQLPASVAMFFASIVCFLVFKGKFSDKIHIFIK 
GAAQYDIILMCLIFMLSGAFSSLCKEIGCV^IVANLGIKYINPN^ 
NIAVKSGINPNLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDW 
ATNFLHESSIDLVKTVPYI^IIFFSIJVGMNWIVLFLGILSICLISVLYGNL 

LTGGVSF AVI HNGGFKWLL I KLKS LI RGKS S AEF S IGAFVS rVDVFLANNT I AI L I CGKVAKKI AF ENN I SVQRS A 
S ILDMFSCIFQGI I PYGAQMI I LVNFSNGLVSPI S ILPFLVYFGFLLFFVILS ILGLDIKKVFLFFLKK 

t253.aa 

LWKGKPSDKIHIFIKGAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLG 

AGTSVGS IVAI API AFOT AVKSGINPNLI AASVMCGAMFGDNLSL ISDTT I VS SRTQGS S I LDVFI S S S FYAF PSA 

ILTFPSFFFLSENLSNATNFLHESSIDLVKTNHPYI^ 

INKGFLNMADLIFLSILTGGVSFAVIHNGGFKWLLIKLKS 

AKKIAFENNISVQRSASILDMFSCIFQGIIPYGAQMIILVNFSN^^ 

KVFLFFLKK 

f253.nt 

ATGTATATGGAAAATATTGAAGTAAGAGGGCAGCCAAATTTT^ 

tctatttaggcacggggatttatttg^ 

aatgttttttgcttccattgtttgtttt™ 

ggagcagctcagtacgatattatactaatgtgtcttatttttat^ 

AAATAGGCTGCGCTGAAACTGTAGCAAATTTGGGAATTAAATATATTAATC 

AATATTGCTCTTAAAAGCGGCATTAATC^ 
ATCTTTCTTTAATATCAGATACAACTATTGTTTCTAGTC 
TAGCAGTTTTTATGCTTTTCCATCCGXXATACTAAC 
GCCACAAACTTTTTACACGAAAGTTCAATAGATTTAGTG 
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TAGCTGGAATGAATGTTTTTA 

TTTATACTTTCTAGATGTAATGAAAAACATTAATAAAGGGTTTT^ 

TTAACAGGGGGAGTTTCTTTTGCCGTGAT^ 

TTAGAGGAAAAAGTTCAGCGGAATTTTCTATTGGGGCTTTTC 

AATTGCCATACTTATTTGCGGGAAAGTAGCA 

TCTATTTTAGATATGTTCTCTTGTATTTTTCAA 

TTTCAAATGGACTTGTGTCGCCAArTAGTATTTTGC^ 

TTTATCTATTTTGGGCCTTGATATAAAAAA^ 

t253.nt 

TTGK5TATTTAAAGGAAAATTTTCCGACAAA^ 
TGTGTCTTATTTTTATGCTTTC 

TTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTGGTAT^ 

GCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTA 

CGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATAA 

TGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTTTTATT 

ATACTAACTTTTTTTTCTTTTTTC 

TAGATTTAGTGAAAACTGTGCCTTATTTAATC 

TTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTC 

ATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTT^ 

TTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTG^ 

TATTGGGGCTTTTGTTTCAATAGTTGATC 

GCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCTTCTA 
TTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTA 
TATTTTGCCATTTTTAGTTTATTTTGGATTTO 
AAAGTTTTTTTATTTTTTTTAAAAAAATAA 

f265.aa 

MRKCFVSLSLLLI FFAC SSNVEIELNDDISGI VS I FVNVNREFEKIRKELLTTLVGEE IANMPLF PVDE I KKYFKN 

GEEKLGLKLLSIKTQGDSINLVVKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINL 

DALAALLPSDEIPMSAKEYKDW^VYFLSDFTSK^ 

LSLETPIKLRLV 

Y 

t265.aa 

SNVE IELNDDI SGIVS IFVNVNREFEKI RKELLTTLVGEEIANMPLF PVDEI KKYFKNGEEKL/3LKLLS IKTQGDS 
INLWKFDNLIKILGDYMKKPDISVFKIEKKDGKNI IELNINLENATKNINENKEYISDALATVLLPSDEI PMSAKE 
YKDVLVYFLSDFTSKASELIDNSKLNLV^ 

f265.nt 

ATGAGAAAGTGTTTTGTTAGCTTGAGTTTATTGTTC 

ATGATGATATTAGTGGTATTGTTTCAATATTTGTTAA 

AACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTC 

GGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAG^ 

ATAATTTAATTAAAATTTTAGGCGATTATATGAAAAAACCCGATATATCTGTO 

TAAAAATATTATTGAACTTAATATTAATTTGGA 

GATGCACTTGCTGCTCTTTTGCC^TCGGATGAGATC 

TTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAA 

AAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACAC^ 

TTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 



t265.nt 



WO 98/59071 PCT/US98/12718 

116 

TABLE 1. Nucleotide and Amino Acid Sequences 



TCTAATGTTGAAATTGAGTTAAATG 

AAAAAATTAGAAAAGAACTCTTAACAAC 

AATAAAAAAATACTTTAAAAATGGAGAGGAAAAGCTTGGGC 

ATTAATTTAGTTGTTAAGTTTGATAATTTAATTAAAATTTO 

TTAAGATAGAAAAAAAAGATGGTAAAAATATTATTGAACTTAATATTAATTTGG 

TGAAAATAAAGAATATATTAGTGATGCACTTGCTGCTC 

TATAAAGATGTTTTGGTTTATTTTTTATCGGATTTTAC 

ATCTTGTAGTTAAGACTTCTAGAAATGTTCAAGAACAATTTGGAT^ 

TGAGATGGATATGGTTAAAGGATTAAGTCTTGAAACACCAATAAAACTTAGA 

f269.aa 

MNIRKLLFCIFFMNISFLLFAGDYKGLDFKIKFFNQSIYRVNSNVFIEVSLSNASESVLTLEIGDINSFGFDFDVT 

OTTNIKVKRPIE^VKKRSKNVAIPVIU^ 
TLFLNIX3FDENPGSIDLVNLSENNDIQDILKKKKLS 

KQKLSPIPNKNVVEEYKEYLWNSNNSDISKAPNKFSIIETTYSDTSGKVIADL 
WIIYDYIVQNTGIKEK 

t269.aa 

GDYKGUJFKIKFFNQSIYRWSNVFIEVSLSNASES^^ 
AIPVRmSIJlPNEKFSVVINIiNQFVKFSKIX^rm^GIFFPDISDPS 
ENNDIQDI LKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDI 

NSNNSDI SKAPNKFSI IETTYSDTSGKVI ADLYFDDGQFYI SKRYTFFFKKYDYYWI IYDYIVQNTGIKEK 
f269.nt 

ATGAATATTAGAAAATTGCTTTTTTGTATCTTTTTTA 

GCCTTGATTTTAAAATCAAGTTTTTTAATCAATC 

TAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAA 

GATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATC 

GAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTA 

TGGAGTTTATTTTGTTAAGGGTATTTTTTTC 

ACGCTTTTTTTGAATGATGGTTTTCATGAAAATC 

TTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAA 

GAAAAAAGAAAAGTTCTTTTTATATCTTGATATT^^ 

AAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTG 

CGGATATTTCAAAAGCACCAAATAAATTTTC 

TGATTTATATTTTGACGATGGGCAATTTTATATTTC 

TGGATAATTTATGATTACATTGTTCAAAATACrreGC^ 

t269.nt 

GGAGATTACAAGGGCCTTGATTTTAAAATCAAGTTT^ 

TTGAAGTTTCTCTTAGTAATGCGTCTGAGAGTC 

TTTTGATGTTACTGATACCACCAATATTAAAGTTAAAAGACCTATT^ 

GCAATTCCTGTTAGAAATATGAGCTTGAGACCTAATGAAAAATTTTC 

AGTTTAGTAAAGATGGAGTTTATTTTGTTAAGGGTATTTT^ 

ATTCAATATTATTACGCTTTTTTTGAATGATGGTT^ 

GAAAATAATGATATTCAAGATATCTTGAAAAAGAAAAAATTATC^ 

CATTGCAGCTTGGGAAAAAAGAAAAGTTCTTTTTATATC 

GGCATACCTTTATAAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTC 

AATTCTAATAATTCGGATATTTCAAAAGCACCAAATAAATTTTC 

GCAAGGTGATTGCTGATTTATATTTTGACGATGGGCA^ 

ATATGATTATTATTGGATAATTTATGATTACATTG 
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f29.aa 

MNWLSFFYVLLFLLIFPFEI^SNNKENIENLIK^ 
KEANDFLKKINQKKIKNQKIK^IISLKLRINEDNIN^ 
1 1 LTNYPKS I YS YKI KRNE 



t29.aa 

NNKENIENLIKLHMLYDLTNNL^ 

EI ISLKLRINEDNINEEEIIOtlLNNEKNI 

f29.nt 

ATGAACTGGCTATCCTTTTTTTATGTTTTATTATTTOT 
AAAATATAGAAAATTTAATAAAGCTACATATGCTTTATC 

AAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTA 

aaagaagctaatgattttttaaaaaaaataaaccaaaaa 
o^taaaattaagaataaatgaagataatattaa 
agatgtcaaaataatttatcaaatattcagtcttataaaatto 
ataatactaacaaactatcccaaaagcatttatt^^ 

t29.nt 

aataataaagaaaatatagaaaatttaataaagctacat^ 

tagaaac^taaataaaattaaaaattttgacttagaacaacattatctgot 

aaaaaaatataaagaagctaatgattttttaaaaaaaataaaccaaaaaaa 

gaaatcatttcgctaaaattaagaataaatgaagataatattaatgaag^ 

aaaaaaatatagatgtcaaaataatttatcaaatattc^ 

aattaaaaacataatactaacaaactatcccaaaagcatttattcttataaaataa 

f290.aa 

MNSIWIGKLLLTLFLIFFPFCYNLFAVNLAEINKLSEYAKSIVLIDFDTKRILYSK^ 
ALIEAEKRNIKLKSIWISDSASYYNAPPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSS 
LMNINVLNUSLFNMHFVEPSGYSSENKI^ 
QRNANIilYDYPYSTCIKTGYIKESGLtnjVA 

FPLIVKLKEKVYNGTViyrVALFSKEPFYYILTKDEFDKINI SYTVDKLVAPLSGDMPVGRAMI FLENEKIGDVALF 
SGKVKRLGFWQGLYKSFINLFSREY 

t290.aa 

VNLAEINKIjSEYAKSIVIjIDFDTKRILYSKKPNLVFPPASLTKIVTITO 

PPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSSIAIAEFW 

ITALDMAFFVKSYIEKFKFMI^IHSLKYFIYPKSR^ 

NLVATAKKGERRLIAVVLGVEKGINGFGEKMRSSIAKNLFEYGFNKYSKFPLIV^^ 
YYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAM^ 

f290.nt 

ATGAATAGTATCTATGTTATTGGGAAATTGTTATT^ 

TTGCAGTTAATTTAGCTGAGATTAATAAATTA 

GCGAATACTTTATTCTAAGAAGCCCAATTTGGTTTTTCC 

GCTTTAATTGAAGCTGAAAAGCGAAATATAAAATTAAAAAGCATAGTTC 

ATGCACCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGG 
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TTCAGTTTCTTCGGGTAATGATTCT 

TTAATGAATATTAATGTTTTAAAOT 

ATAAGATTACAGCACTAGATATGGCTTTTTTTC^ 

TTCTTTAAAGTATTrTATTTATCCAAAGAGTAGAAATT^ 

CAAAGAAATGCTAATTTATTAATATATGATO 

GCTTAAATCTTGTTGCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCA 
TAATGGATTTGGAGAGAAAATGAGATCTTCGATTGC^ 

TTTCOTTTAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAG 

CTTTTTATTATATTTTAACTAAAGATGAATTTGATAAAATTAATATAAGTO 

ACTTAGTGGGGATATGCCTGTTGGGAGGGCTATGATTTTTOT 

AGTGGCAAGGTAAAAAGATTAGGGTTTTGGCAAGGTC^^ 

AA 



t290.nt 

GTTAATTTAGCTGAGATTAATAAATTATCAGAGTATGCAAAGTCAATAGTTOT 

TACTTTATTCTAAGAAGCCCAATTTGGTTTTTCCTC 

AATTGAAGCTGAAAAGCGAAATATAAAATTAAAAAGCATAGTTC 

CCCCCCAATTCTTCTTTGATGTTTTTA 

TTTCTTCGGGTAATGATTCTTCTATTGCAATTG 

GAATATTAATGTTTTAAATTTAGGGCTTTTTAATATGCAT^ 

ATTACAGCACTAGATATGGCTTTTTTTGTGAAATCTTATATAG 

TAAAGTATTTTATTTATCCAAAGAGTAGAAATTTAGGAACTGCTTO 

AAATGCTAATTTATTAATATATGATTAC^ 

AATCTTGTTGCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTO 

GATTTGGAGAGAAAATGAGATCTTCGATTGCAAAAAATTTATTT^ 

TTTAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAGTTGCTC 

TATTATATTTTAACTAAAGATGAATTTGATAAA 

GTGGGGATATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATC 

CAAGGTAAAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATC 

f291.aa 

MNSYDFITALVPI ILI IIGLGI IKKPAYYVI PISLIAWAIVIFYKNLGIVNTSLAMLEGALMGIWPI ATVI IAAI 
FTYKMSEDQKDIETIKNILSNVSSDRRI IVLLVAWGFGNFLEGVAGYGTAVAI PVSILIAMGFEPFFACLICLIMN 
TSSTAYGSVGI PITSLAQATNLDVNIVSSEIAFQLILPTLTI PFVLVTLTGGGIKGLKGVFLLTLLSGMSMAI SQV 
FISKTLGPELPAJl^SILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYI 

IjKTFQSTIS IYPEANPLHFKWI I S PGFLI ILATTI SYS IRGVPMLKQLKIFTLTLKKMALSSFI I IC IVAI SRLMT 

HSGMIRDLANGISIITGKFGPLFSPLIGAIGTFLTGSDTVSNVLFGPLQTQMAENIG 

I SPQNITIATTTAGLIGQEGKLLSKTI I YALYYI LATGLLVYLV 

t291.aa 

QKDIETIKNILSNVSSDRRI IVLLVAWGFGNFLEGVAGYGTAVAI PVSILIAMGFEPFFACLICLIMNTSSTAYGS 

VGIPITSLAQATNLDVNWSSEIAFQLILPTLTIPF^ 

ELPAILGSILSMTITIWARFFGNKETTERQSKNTISLSKGI^ 

SIYPEANPLHFKWIISPGFLIILATTISYSIRGVPMUCQLKIFTLTLKKMALSSFIIIC 
ANGISIITCKFGPLFSPLIGAIGTFLTGSOTVSNVLFGPIXJTQMAENIGA 
ATTTAGLIGQEGKLLSKTTI YALYYI LATGLLVYLV 

f 2 91. nt 

ATGAATTCTTATGATTTTATAACAGCTTTGGTACCAATAATC 

CAGCTTACTATGTAATACCCATATCATTAATAGCCACCGTTGCTATAGTT 

AAACACAAGTCTTGCAATGCTTGAGGGCGCCTTAATGGGGATATTO 
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TTTACATACAAAATGTCAGAAGATCAAAAA 
GAAGAATTATAGTATTACTAGTTGCATGGGGATT^ 

TGCAATTCCTGTATCAATATTAATAGCAATGGGATTTGAACCATTTTT^ 

ACOTCATCAACCGCCTACGGATCTGTGGGAATCCCTATAACATCTTTAGCTC 

TTGTTTCATCTGAGATTGCATTCCAACTAATA 

AGGGGGCATTAAAGGATTAAAAGGAGTATTCCTTCTTACCTTACTCTCAGGAATC 

TTTATATCAAAAACTTTGGGTCCAGAACTTCCTGCAATCC 

ATGCAAGGTTTTTTGGAAATAAAGAAACTACTGAGCGCCAAAGCAAAAA 

TGCCTGCTCACCCTACATTTTAATAGTAACTTTTATAGTGCTTGTATCTCCTC 

OTAAAAACTTTTCAAAGCACTATTAGCATTTATCCAGAAGCAAATCCCT^ 

GCTTCTTGATTATACTTGCAACAACAATATC 

TACATTAACCTTGAAAAAAATGGCATTATCTTCCTTTATAATCATATGCA 

CATAGTGGAATGATAAGAGATCTTGCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCAT^ 

TAATTGGAGCTATTGGGACATTTTTAACAGGAAGTGATACGGTT^ 

AATGGCAGAAAATATTGGAGCAAATCCTTA 

ATTTCTCCCCAAAACATCACAATAGCAAC^CAACTGCTGGATTAATTGGACA 
CAATAATTTATGCTTTATACTACATTTTAGCAACAGGATTGCTAGTTTATTTAGTATAA 



t291.nt 

CAAAAAGATATAGAAACTATTAAAAATATTTTATCAAACGTATCTTCTCATAGAAGAATT 

CATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATG^ 

AGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTC 

GTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTT 

AAOTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTA 

AGTATTCCTTCTTACCTTACTCTCAGGAATGTCAAT^ 

GAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTTATGC 

AAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCA^^ 

AGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATG 

AGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTC 

CAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATOT 

ATTATCTTCCTTTATAATCATATGCATTC 

GCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCACTAAT^ 
TAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTG 
TCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAA^ 
CAACAACAACTGCTGGATTAATTGGACAAG 



f296.aa 

MPSPIRVFFLVLLFIFIFNPVLIAMLFILFPFILILFSFIjGWRIYFTRDYSYSRSREFEFYKLSFLL^^ 

GTVTGEQLNYWFIINSLNLSERGKSEL^ 

KNLEAEKFISWGVFLELESIXSYEAYKDINIKIVNPYSVLGLT^ 

ANDKFIKIQDAYEKICKERNIR 

t296.aa 

IYFTRDYSYSRSREFEFYKLSFLLMAKLLSILGTVTGEQLNYVOT 

ILYTLKLGYFQHKDLFIWLFATLKEINRLSRYKNLEAEKFI SYVGVFLELESDGYEAYKDINI KIVNPYSVLGLTY 
SASDDEVKKAYKSLVT KYH PDKFANDPVRQKDANDKFIKIQDAYEKICKERNIR 

f296.nt 

ATGCCAAGCCCAATTAGAGTGTTTTTTTTAGTC 

TTTTTATTTTATTTCCTTTTATTTTGATATT 

CTCATATTCTAGATCTAGAGAGTTTCAATTTTATAAACTTTC 
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GGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATT^ 

O^GAATTGTATACCATTTTTCATTCTGCTATTACTAA 

TGGTTATTTTCAGCACAAAGATCTT^ 

AAAAATTTAGAAGCTGAAT^AATTTATTTCTTATGTTGGTG 

ATAAAGATATTAATATTAAAATTGTAAATCCTTATAG 

TAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGC^ 
GCAAATGATAAATTTATAAAAATTCAAGATGCTT^ 

t296.nt 

ATATACTTTACAAGGGATTACTCATATTCTAGATCTAGAGAGTTTGAAT^ 

OTAAATTGCTATCTATTTTAGGAACTGTAACTGG 

TTTGTCTGAACGTGGTAAATCAGAATTGTATACCATT^ 

ATTTTATATACCCTTAAGCTTGGTTATT^ 

TTAACAGGCTTTCTAGGTATAAAAATTTAGAAGCTC 

ATCTGATGGTTATGAAGCTTATAAAGATATTA 

AGTGCTAGCGATGATGAGGTTAAAAAGGCGTATAAAA 

ATCCTGTAAGACAAAAAGATGCAAATGATAAATTTATAAAAATTC 

AAATATAAGGTAA 

f3.aa 

MKKKNLSIYMIMLISLLSCNTSDPNELTRKKM^ 

ESNFPYYLQEEIEIKEEELVPNTDEEKKAEKAISTC 

AE\mVAGRINSYIKKRKTTKEKEYKKREIKNKIEKQALIKLFNQLLE 

KETLKAAITERLNNKRKNRPWWARRTHSNIA 

KSKIFSSGDRLYDFLETSK 

t3*aa 

NELTRKKMQDKNVKI LGFLEKIQADNKEI VEKH I EKKEKQMVQAASVAP INVESNFP YYLQEEI EI KEEELVPNTD 
EEKKAEKAISIX3SLEFAKLVDDENKLKNESAQL^ 

KKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKAKETL 
IH'HSNIiAIQAKNEAEDALNQI^TSSFRILEAMKIKEDVKQLL 

f3.nt 

ATGAAAAAAAAAAATTTATCAATTTACATGATAATGCTA^ 

AATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAG 

AGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATG^ 

GAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGA^ 

AAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTT^ 

AAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTAT 

GCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAA 

AGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATT^ 

TATTGAAAATCTTCATACTCAATTAAATAG 

AAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATC 
CACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCT^ 
GATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGC^ 
AAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGA 

t3.nt 

AATGAATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTT 

ATAAAGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATGGTC 

TGTAGAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAA 
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GAAGAAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTC^ 

TTAAAAATGAATCTGCGCAATTAGAATCT 

ACAAGCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATAT^ 

AAGAAGAGAGAAATTAAGAATAAGATAGAAAAAGAGGCTCTAATTAAGTTGTTC 

GCGATATTGAAAATCITCATACTCAATTAAATAGTGGACTTAGCGA 

AGCCAAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCA 
AGAAGACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTT 
TTAGGATACITGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTC 
TTCAAAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATT^ 

f30.aa 

MNKKILTLLVLILSI SSVLMLSKSITKKSKYKI IRDYFINSNYVLVKIENKDLKFTI SKPI YDKKLNNYFFKGQTT 
SHFLISNNVDIAINTSPYEVKQNMFFPKGLYIYNKKMI SKQINNYGEIVTKHNKI IIiNPKEDEI ENCDYGFSGFFV 
LI KNGKYKKNFKETRH PRT 1 1 GTDKNNKHLFLVT I EGRGVNN SKGAS LNEAIDFAL S YGMTNAINLDGGGS S TL W 
KSNNAPYKLNFTANIFGQERPVPFHLGIKLPN 

t30.aa 

LSKSITKKSKYKIIRDYFINSNYVLVKIENKDLK^ 
KQNMFFPKGLYIYNKKMISKQINNYGEIVIKHNKIILNPK^ 
IGTDKNNKHLFLWIEGRGVNNSKGASLNEAIDFALSYGMTN^ 
PVPFHLGIKLPN 

f30.nt 

ATGAATAAAAAAATATTAACACTGCTAGTATTGATTTTAAGTATTTCATCAGTACTAATGCTC 

CCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGC 

AGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACOT 

AGCCATTTCTTAATTTCTAACAATGTTGACATT^ 

TCCCAAAAGGACTATAGATATATAATAAAAAAATG^ 

GCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAG 

TTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGA 

ATAACAAGCATTTATTTCTTGTTACAA 

TATTGATTTTGCATTAAGCTACGGCATGACTAACGCTAT^ 

AAATCAAATAACGCTCCTTAGAAATTAAACTTCA 

TAGGAATAAAACTTCCTAATTGA 

t30.nt 

CTGTCCAAATCAATCACCAAAAAATCCAAATACAAAATTATT 

TGAAAATTGAAAATAAAGATCTAAAATTTACCATATCAAAACCTATTTA 

TAAAGGCCAAACAACAAGCCATTTCTTAATTTC 

AAACAAAACATGTTTTTCCGAAAAGGACTATA 

GAGAGATTGTAATAAAGCACAAGJyVAATTATAT^ 

TAGCGGATTTTTTGTTTTAATCAAAAACGGAAAGTATA^ 

ATAGGAACTGATAAAAATAACAAGCATTTATTTCTTGTTAC 

CCTCTCTTAATGAAGCTATTGATTTTGCATTAAGCTACG^ 

AAGCACTCTTGTTGTAAAATCAAATAACG^ 

CCTGTCCCATTTCATTTAGGAATAAAACTTCCTAATTGA 

£308. aa 

MQLLKNKYPFKRAIiI^LFLVYAIVYLASPP^VNVNSEFWNV^ 
IPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFLLEY 
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EELFYRAFVITKFTQMGFPWATAI LSSMFFAYGHLYYGI LGFLVTFILGI FFAFTYLRYKNVYYVI FIHSFYNI I 
VSSLLLFLN 

t308.aa 

NSEIFWNVDENHFYFWISRSFLIIFIIYFFKLTSSYDDFRVEFFIPKFKFIFLWDSVLIFIKTILIA 
LEYLLPESVLVYYFQNNAGFNWKI SSKKAFFLMTFTSFFTGAFEELFYR^ LS SMFFAY 

GHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIHSFYNIIVSSLLLFLN 

f308.nt 

ATGCAATTGTTAAAAAATAAATATCCATTCAAGCGGGCTTTGCT^ 

TGGCATCTCCTTTTGTAAATGTTAATTCAGAATTTTGG 

ATCTTTTTTAATTATTTTTATAATTTA 

ATTCCTAAATTTAAATTTATTTTTCTTTGGG 

TCATTTTTTTAATAGCTTTTTTGCTTGAATATTTC 

TGGATTTAATTGGAAGATTAGCAGTAAAAAAGCATTTT^ 

GAAGAACTTTTTTACAGGGCTTTTGTTATTACTAAGTTO 

TTAGTAGTATGTTTTTTGCTTATGGGCATTTATATTATG^ 

ATTTTTTGCTTTTACTTATTTAAGGTATAAAAATGTATA 

GTTAGCAGCTTGTTGCTTTTTTTGAATTAA 

t308.nt 

AATTCAGAATTTTGGAATGTTGATGAAAATCATTT^ 

TTTATTTTTTTAAACTTACCAGTTC 

TCTTTGGGATTCTGTTTTAATTTTTA 

CTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATT^ 
GTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTOT 

TGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTC 
GGGCATTTATATTATGGAATTTTAGGATTTTTGGTTA 
GGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTAT 
GAATTAA 

f31.aa 

MKKYLFFILFLISSNNLIVSYPLSFGGGFSYQFTNYTDKTGATKFAPNFTRADH 
FV\nBNGRYFSLGLYGTYPM\^KEQVRMLFPLIGFKYAFDLSSNNFNLFFLSMG 
ISPFSNYKNFSGLTTEIMLGFNIGWRFFN 

t31.aa 

^SYPLSFGGGFSYQFTNYTDKTGATKFAPNFTRAD^^ 

YPMVFKEQVRMLFPLIGFKYAFDLSSNNFNLFFLSMGLAAD^ 

MLGFNIGWRFFN 

f31.nt 

ATGAAGAAATATCTTTTTTTTATTTTATTTC 

GTGGAGGTTTTTCTTATCAATTTACTAATTATACTGAT 

AGCAGATCATGGGATTAATTTGAATTTATTT^ 

TTTGTTGTTACTCACAATGGGAGATATTTCTCGCTTG 

TTAGAATGCTTTTCCCATTAATTGGGTTTAAATATGCTT^ 

AAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTC 

ATTTCTCCATTTTCTAATTATAAAAATTTTTCTGC^ 

GATTTTTCAATTAG 
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t31.nt 

ATTGTTTCTTATCCACTTTCTTTTGGTGGAGGTTOT 

CTAAATTTGCTCCAAATTTTACCAGAGCAGATCATGGGATTAATTT^ 

TTTTGAAATGTCTTACAAAGAGGCTTTTGTTGTTACTCA 

TATCCAATGGTTTTCAAAGAGCAGGTTAGAATGCTTTTCCCAT^ 

CTAATAACTTCAATCTCTTTTTT^ 

TATTAGGCCTTTGTTTATGCTTTCT^^ 

ATGCTTGGATTTAATATCGGTTGGAGATTTTTCAATTAG 



f939.aa 



MKQKYENYFKKRLI LNLL I FLLLACSSES IFSQLGNLQKIKHEYNI LGS SS PRGI SLVGETLYIAAMHLFKKENGK 
IEKIDLSNSYEFINDIVNISGKTYLIAQNKEEELEVCELNGKDWTLI^ 

NNREKIFDLQGSDKTPPQATENDKFYQISNEENLITGNSL^ IRGSSEVL 
VMTGGYNNLIXnCFKVYSNTNNYTTP^ 

PGAYNGSQLSKTGLNDI I PVSNNTI YILTQGKGLWKLENRKLTKE 
f939.aa 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAA 

LLAQNKEEELEVCEI^GKDWTLKFKKPLKAYKFL^ 

FYQISNEENLITGNSLKIWQMNNNTYTO^ 

PIFIQDEVGEFSSYFAREFNDAILIGSNNGFAEFTK^^^ 

IYILTQGKGLWKLENR 

KLTKE 

f939.nt 

ATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACC^ 

CAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAA 

TCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCT 

ATTGAAAAAATTGATTTGAGCAATTCTTAT^ 

TAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGA^ 

GCTAAAAGCATATAAATTCTTAAAATCCGTAGGAAGAGATGGCX5TAA 

AATAATCGTGAGAAAATrTTTGATCTACAAGGATCTGACAAAACACCAC 

ATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTC 

AAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAA 

GTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTO 

TATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAG^ 

TAATAATGGATTTGCAGAATTTACAAAAAATAAAG 

CCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTO 
ACATATTAACTCAGGGCAAGGGTTTGTGGAAATO 

t939.nt 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAA^ 

CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTG 

CAAGATTGAAAAAATTGATTTGAGCAATTCT^ 

CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTC 

AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGGAA 

TAAAAATAATCGTGAGAAAATTTTTGATCTACAAGGATCTGA 

TTTTATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTC 
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ACACAAACATAGACTATCAACAOSCCAAA^ 

TTTAGTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTA 

CCAATATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGC 

GAAGTAATAATGGATTTGCAGAATTTACAAAAAATAAAGAAGGAATTTOT 

AGAACCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTT 

ATTTACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAA 

f739.aa 

MQSGLKIKLILFFCCFACSCDINYPEIKELDYKINYYFTENRLDYSMSFDFAIKVINSKDW 
VINNNYSSFFIDSSLGKDILYCKDLRFNFFDKTFEDFTSCVRLFDKGMRVYNRELV^ 
KDMEMLNKLSNSKVFFVKTYKDKLHPVS SWRIDS IDI LEIDKAFDNYI SFYYVEKNSNLFFKVG 

t739.aa 

CCFACSCDINYPEIKELDYKINYYFTENRLDYSMS FDFAIKVINSKDVFKLS IENKNTNEF IQVXNNNYSS FFIDS 
SLGKDILYCKDLRFNFFDKTFEDFTSCVRLFDKGMRVY 

VFFVKTYKDKLHPVSSVVRIDS IDI LEIDKAFDNYI SFYYVEKNSNLFFKVG 
f739.nt 

ATGCAGAGCGGATTAAAAATTAAATTAATATTG 

AGATAAAAGAGCTTGATTATAAGATAAATTATTA^ 

TGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATT 

GTGATTAATAATAATTATAGCTCTTTTTTTATTGATTC 

GGTTTAATTTTTTTGATAAAACTTTTGAAGATTTO 

CAATAGAGAGCTTGTTATTTCTTTGGGTATGTC 

AAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAAGTATTTTTTGT^ 

CGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTCTAG 

TTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGTT^ 

t739.nt 

TGTTGTTTTGCTTGTTCTTGCGACAT^ 
CrcAAAATCGCTTAGATTACTCTATGAGTTT^ 
ATCAATAGAGAATAAGAACACTAATGAGTTTATTCAAGTGATTA 
AGCCTTGGAAAGGATATTCTATATTGTAAGGA 

CATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATACAATAGAGAG^ 

TGATTTAGATGATGTTCACAATTATGTAT 

GTATTTTTTGTTAAAACTTATAAAGACAAACT^ 

TAGAGATTGATAAAGCATTTGATAATTACATAAGTTTTTATTA 

TGGCTGA 

f742.aa 

MNKKHTNFSVLLLLIFLLILSFGGFGYYIYQSKLNDKNR^ 
MLTNTLAEIANSSPFESKDLQRDSANQILDKIKG^ 
DISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSVNDQK^ 
LSKEKNSENILKTPDNSKYSNNNNTTSLIOttSSNSQKK^ 

TLGKNRLKEL I KKGLSNKFQKVNELI ENSKNKEASI^LLTLI KKDI EPNLINI PKDP YKKEI FQLDKEDKK PQYLE 

DIJCSKVHSIKPIDLENTKSRQQAIKDLNEFIJQJNPNDAQASKTLA 

AIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLE^ 

AQANKIQHI^LKSKVHSIKPIDI^NTKSRQQAIKDLNEFXKNNPNDAQ 

DLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQANKIQHLEDLKSKVH 

PNDAQASKTLAQANKIQHLEDLKSKVHS I KPIDLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQAYENNGDLLK 
AENAYEKIIKLTOTQEDHYKLGIIRFKLKKYEHSIESFDQTIKLDPKHKK^ 



WO 98/59071 



PCT/US98/12718 



125 

TABLE 1. Nucleotide and Amino Acid Sequences 

QIDKNYGTAYYQKGI AEEKNGDMQQAFAS I 
AIYNLSIAKFENNKLEESLETINKAIDLNPEKSEYLYLK^ 

YEKSGNKSQAI STLEKI INKNNKLALNNIjGILYKKEKNYQKAIEIFEKAI INSDIEAKYNLATTLIEINDNTRAKD 
LLREYTKIJCPNNPEALHALGIIEYNENNNDOT 

t742.aa 

KLNDKNRE IMLNEVKNSVI DRNYKKA YSVAKLLQDKYPQNED IAMLTNTLAE I ANS S PF ES KDLQRDS ANQ I LDK I 
KGQD 

ottctnvnenfdiafnnryikdstitenysdrnddvgiededisefkkskipekikpn™ 
nix^knlfnlekliocnlsgksnsenil^ 
keselsppsqtiigkiyrpysylikkelyeilddintc^ 
ij^tlikkdiepnlinipkdpykkeifqldkedra 

daqasktlaqankiqhledlkskvhsikp idlentksrqqai kdlneflknnpndaqasktlaqankiqhledlks 
kvhs i kp i dlentksrqqai kdlneflknnpndaqasktlaqankiqhledlkskvhs ikp idlentk kd 
lnefxknotndaqasktiaqankiqhledlk^ 
kiqhledlkskvhsikpidlentksrqqaikd^ 

tksrqqaikdlnefxknnpntxaqasktlaqayenngdllkaena i kltntqedhyklgi i rfklkkyehs i e 

sfix^tikldpkhkkalhnkgxaiimmlnknkkaiesf 

dknpnyalkagivsnnlgnfkqseeylot 

ylkas inlkkenyqnai slyslvi eknpentsayinlakayeksgnksqai stleki inknnklalnnlg i lykke 
knyqkai eifekai insdi eakynijvttlieindntrakdllreytklkpnnpealhaix5ii 
ikkfpnykknenikkiigi 

f742.nt 

ATGAATAAAAAACATACAAATTTTTCGGTATTATTGCTT^ 

GTTACTATATATATCAAAGCAAATTAAATGAGAAAAATC 

AGATCGAAACTATAAAAAAGCATATTCTGTTGCAAAACTTC 

ATGCTTACAAATACACTAGCAGAAATTGCCAACAGTAGTC 

ATCAAATCTTAGACAAGATCAAAGGTCAAGACAATA^^ 

TAATAGATACATTAAAGACAGCACAATAACAGAAAACTACTCTGACAG 

GACATATCTGAATTTAAAAAAAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAA 

TAATACAATCTCCAAATCCGAAATTAAGTGTTAATGACGAAAAAAA 

TTTAAGTGGAAAATCAAATAGTGAAAATATTTTAAACGATTC 

ttatccaaagaaaaaaattcggagaatattttaaaaactc 
catctttaaaaaaaatttcttcaaattcc^^ 

AATTTATAGGCCATATAGCTACTTGATAAAAAAAGAGCTCTATGAAATATT 

ACACTTGGAAAAAACAGATTAAAAGAATTAATTAAAAAAGGTC 

TTGAAAATTCAAAAAATAAAGAAGCTTCAAATTTACT^ 

TAATATACCAAAAGATCCTTACAAAAAAGAAATTTTTCAATTAGATAAAG 

GACCTTAAATOTAAAGTTCATTCAATAAAACCCATTGATC 

ATCTAAACXSAATTCTTGAAAAACAATCCCAATGA^ 

ACA(XrTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAA 

GTCCATTAAGGATCTAAACX3AATTCTTGAAAAACAATC 

ATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCAT^ 

ACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAAT^ 

GCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCA 

ACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTC^ 

AAAOTTTAGCTCAAGCTAATAAAATACAACACCTGAGGACCT^ 

CTTGAAAACAGAAAATCACGCCAACAAGCCATTAA^ 

GCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAATC 
CCATTGATCTTGAAAACACAAAAT 

CACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATC 

AGCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTC^ 

AAGAGAAAATCACGCCAACAAGCCATTAAGGATC 
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TAAAACTTTAGCTCAAGCTTATGAAAACAATGGAGATTTGCTAAAA 
CTCAGAAATACCCAAGAAGATCACTATAAACTT^ 

AATCATTTGATCAAACAATAAAACTCGACCCAAAACATAAAAAAGCACTTCA 

GCTAAATAAAAACAAAAAAGCAATAGAATCTTTTGAGAAAGCAATACAAATO 

TACCAAAAAGGAATAGCAGAAGAAAAAAATGGCGATATGCAAC^^ 

TCGACAAAAACCCCAATTATGCATTAAAAGCAGGAATA^ 

GTATTTAAATTTTTTTAATGCCAATGCAAAAAAACCTAACGAAATTGCT^ 

GAAAACAATAAACTTGAAGAATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATC 

TATATTTAAAAGCATCTATAAATCTTAAAAAAGAA 

AAAAAACCCTGAAAATACTTCAGCCTATATAAACCTGGCAAAAGCATATGAAAAATC 

ATCTCAACTCTTGAAAAGATAATAAACAAAAATAATAAA 

AAAAAAATTATCAAAAAGCAATTGAAATTTTTGAAAAAC^ 

TGCAACCACTCTAATTGAAATTAATGATAACACAAGAGCTAAAGACCT 

AACAATCCAGAGX3CCTTACATGCACTAGGAATAAT 

TTATAAAAAAATTTC CAAATTAC AAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 
t742.nt 

AAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAATAGATCGAAA 

CATATTCTGTTGCAAAACTTCTGCAAGACAAATACCC^^ 

AGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGA 

AAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTC 

GCACAATAAGAGAAAACTACTCTGACAGAAACGATC^ 

AAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAATAATACAATC 
AAATTAAGTGTTAATGACCAAAAAAATTTATTTAAT 

GTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGCAAAACACAAATTT 
GGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTACATCT 
TCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGG^ 
ACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGAT^^ 

AAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAAGTAAATGAAT^ 

GAAGCTTGAAATTTACTATTAACCTTAATAAAAAAAGA 

ACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGC(^^ 

TTCAATAAAACCCATTGATCTTGAAAACACAAAATCAC 

AACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAA 

CTAAGGTTCATTCAATAAAACCCATTGATCTT^ 

ATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTT^ 

GACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATC 

atctaaacgaattcttaaaaaacaatcccaatgacgc^ 

ac^cctggaggaccttaaatctaaggttcattcaataaaaccca™ 

gccattaaggatctaaacgaattcttaaaaacaatcccaatgacgcc^ 

AAAATACAACACCTGAGGACCTTAAATCTAAGGTTC^T^ 

CCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATC 

AGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTGATTC 

AAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATC 

CTTTAGCTCAAGCTAATAAAATAC 

AACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATC 

AGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATC 

ATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAA 

TCACrATAAACTTGGAATCATTAGATTCAAGCTTAA 

AAACTCGACCCAAAACATAAAAAAGCACTTCATA 

CAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCAC^ 

AGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTA 

GCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGAGTAT^ 

CCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAAT^ 

ATCTCTTGAAAC^TAAACAAAGCGATAGATTTAAATC 

AATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTTAGTAA 
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GAGCCTATATAAACCTGGCAAAAG^ 

AATAAACAAAAATAATAAATTAGCCTTAAACAATCT^^ 
ATTGAAATTTTTGAAAAAGCAATAATGAAT^ 

TTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAAC 

TGCACTAGGAATAATAGAATATAATGAAAATAACAATGA 

ACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

f743.aa 

MRIYLFLNKNYKIFILFLILII^SKLAYSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYP 
NQLTEAEGALIJX3IAVGGEIDYILYYELGNIM 

KEKEYQKAWDSYTMAIHDYSQFI TLRSKTEKKDS I LL 1 1 S YLRNEKINLEQLDKSLKGRTEHI VYAKEDKNQI LKD 
SFKDNLETNSLIELEKLNWQEELYIDE 

t743.aa 

YSQRLIRIGKEEMKNKNYIQAI ETLSDAIKKYPKVQLGYYFLS I AYRENNQLTEAEGALLDGIAVGGE IDYI LYYE 

LGNIMFmGEGYYPLAIKYYSNSIKSRPNYDSALLNRANAYVQQGKITS^ 

KTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIW 

f743.nt 

ATGAGGATTTATTTATTTTTAAATAAAAATTACAAGA 
TGGC 

ATATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAA1T 

AGTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAAT^ 

AACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATC 

ATTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACT^ 

AGTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCT^ 

AAAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTC 

AAAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATT^ 

GACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAA 

TTAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACOT 

ATAA 

t743.nt 

TATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAA^ 

GTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACT^ 

ACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAA™ 

TTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACT 

GTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTC 

AAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTC 

AAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATTTAAGAAATC 

ACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAA 

TAAAGACAACCTAGAAACTVAATTCTTTAATTGAGCTAGAAAAACT^ 

TAA 

£748. aa 

MKFIINLLLSTIKIITFT\niVCLT 

RWDGEPQRI PTGGEEKQL IWI m^ARWKI ADINKFYTTIKTMSRAYVRIDAAI EPAVRGVI AKYPLLE 1 1 RS SNDP 
IQRLSNGILTPQETKINGIYKITKGRKIIEKEIIRIANNNTKDIGIEIVDVLIRKV^ 

I AEEQRS IGLAEKTE I LGS I EKEKLKI LSEAKATAAKIKAEGDREAAK I YSNAYGKNI EFYKFWQALES YKAVLKD 
KRKIFSTDMDFFQYLHKRN 
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t748.aa 

IFQPIYILKENEISITTRLGKIQRTENI^ 

KIADINKFYTTIKTMSRAYVRIDAAIEPAVRGVIAKYPLLEIII^ 
IIEKEIIRIANNNTKDIGIEIVE^njIRKVTYDPSLIESVNNRM 

LSEAKATAAKI KAEGDREAAKI YSNAYGKNI EFYKFWQALESYKAVLKDKRKI FSTDMDFFQYLHKRN 
f748.nt 

ATGAAATTTATAATAAATCTTTTATTATCT^ 

TGTCTATTTTCCAGCCAATTTATATTTTGAAAGAAAA 

AACTGAAAATTTAGCTGGACTTAAATATAAAATACCATTAATTGAAAATC 

AGATGGGATQ^GAACCTCAAAGAATCCCAACAGGA 

GATGGAAAATTGCAGACATAAATAAATTTTACACAA 

AGCAATTGAACCTGCTGTTAGGGGGGTTATTGC^ 

ATTCAACGTTTGTCTAATGGAATACTCACCCCACAAGAAACAAAAATTAACGGTATTTAT 

GAAAGATAATCGAAAAAGAAATAATTCGTATAGCAAACAACAATAC^ 

ACTAATAAGAAAAGTTACOTATGACCCAAGCCTTATTGAATCTGTAAACAA 

ATCGCAGAAGAACAAAGAAGCATAGGATTAGCTGAAAAAAC^ 

AAATATTAAGTGAAGCAAAAGCCACTGCTGCAAAAAT^ 

AAATGCATATGGCAAAAATATTGAATTTTACAAATTCTGG^ 

AAAAGAAAAATTTTCTCAACAGACATGGATTTC 

t748.nt 

ATTTTCCAGCCAATTTATATTTTGAAAGAAAATGAAAT^ 

AAAATTTAGCTGGACTTAAATATAAAATACCATTAATTGAAAATC 

GGATGGAGAACCTCAAAGAATCCCAACAGGAGGGGAA 

AAAATTGCAGACATAAATAAATTTTACACAACAATAAAAACAATGAGTAGAGCT^ 

TTGAACCTGCTGTTAGGGGGGTTATTGCAAAATACCCTTTGCTTGAAATO 

ACGTTTGTCTAATGGAATACTCACCCCACAAGAAAGAAAAATTAACGGTATTO 

ATAATCGAAAAAGAAATAATTCGTATAGCAAACAAG^ 

TAAGAAAAGTTACTTATGACCCAAGCCTTATTGAATC 

AGAAGAACAAAGAAGCATAGGATTAGCTGAAAAAACAGA 

TTAAGTGAAGCAAAAGCCACTGCTGCAAAAATAAA^ 

C^TATGGCAAAAATATTGAATTTTACAAATTCTGGCAGGCATTAG 

AAAAATTTTCTCAACAGACATGGATTTC 

f764.aa 

MSGPKKLAIIALLVISIQGCKESSIIEKQFNYAIIFSDATEYFFEIQTTPFIKNEILFINDKNLEI 

ILLTHKSNNEILNNEILKEKIFYLSKIKFSLKKSI^ 

TLBJEKNISYIQTFITSQIKTIILFSL^ 

IAKDVEKDFQYEFNIYKQ 

f764.aa 

EKQFNYAIIFSDATEYFFEIQTTPFIKNEILFINDKNLEI^ 

IKFSLKKSIDFLLNEKSIDLQKTLLFRDKSLN^ 

LRM^IILKKILNSPFSKNIKFVLIGNTRKDLKIIKLKYIITLKEPD 

f764.nt 



ATGTCTGGCCCTAAAAAACTTGCTATAATAGCGCTCTTAGTAATTTCA 
TTGAAAAACAATTTAATTATGCAATAATTTTTTCA 

CATAAAAAACGAAATACTATTTATAAATGACAAAAATTTAGAAATTATAAAAGACAAGCTT 
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ATACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAA^ 

AAATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTG^ 

ATTTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAA 

ACTCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTO 

CTTTAAGAGATAATAATATTATTTTAAAAAAGAT^ 

TGGCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGA 
ATAGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTO 

t764.nt 

GAAAAACAATTTAATTATGCAATAATTTTTTCAGATGCAACTC 
TAAAAAACGAAATACTATTTATAAATGACAAAAATTT^ 

ACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAAGA 

ATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTT 

TTAGAGAGAAATCTCTAAATAACGAAGACCTTGAATACTTGGA 

TCTAATAAACGAAAAAAACATATCCTATATTC 

TTAAGAGATAATAATATTATTTTAAAAAAGATACTAAA 

GCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGC 
AGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTT 

f770.aa 

MINFSKSFPYPLPIGKIFVLSGDMGSGKTSFLKGLALNLGISYFTSPTYNIVNVTO 
ELVGGLEILMDLDS I IAIEWPQI ALS I VPKDRLFSLTFKIVGSGRWELNG 

t770.aa 

KTSFLKGLALNLGISYFTSPTYNIVNVYDFINFKFYHIDLYRVSSL^^ 
VPKDRLFSLTFKIVGSGRWELNG 

f770.nt 



CTGGAAAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTT^ 

TGTTAATGTTTATGATTTTATAAATTTTAAATTTT 

GAGCTTGTTGGGGGATTGGAAATACTTATGGA 

GCATTGTTCCAAAAGATAGATTATTTTCTTTAACTTl^ 

TTAA 

t770.nt 

AAAACTAGTTTTTTAAAGGGACTTGCCCTTA^ 
ATGTTTATGATTTTATAAATTTTAAATTTTATCAT^ 
TGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATT 
GTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAAT^ 

f790.aa 

Mim^TTPLLLLFLIQSLAFSSEIFEFKYIKGSKFRLEGTD^ 

AFFRILKRENINEPYLLNEEFEEIFSVNKQGEYTIGANQKRPSWGIPRFPKTPIKINEKWSYLAEE^ 

SIKDFVVKFNVNYEYKGKEEHNGKHYHI ILSNYESQYNVKNI SFYQKVDQKI YFDNEIGNTYKYSDKYIFEINQNN 

NQHFKMIGNSLGRIVSIELPNDNLIETEVENYIREKKIKAIE^^^ 

AKLLEKFKKNNILIEGHTEQFGLEEEMHELSEKRARAIGNYLIKMKVK^ 

RVEITILNN 



t790.aa 
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SEIFEFKYIKGSKFRIJSGTDNQKIYFN^^ 
EEIFSVNKQGEYTIGANQKRPSVRGIPRFPKTPIK 

NGKHYHI I LSNYESQYNVKNI SFYQKVDQKI YFDNEIGNTYKYSDKYI FEINQNNNQHFKMIGNSLGRI VS IELPN 
DNLI ETEVENYIREKKIKAI EVEKNNKGINLSFDI EFYPNSFQILQKEYKKIDLIAKLLEKFKKNNI L I EGHTEQF 
GLEEEMHELS EKRARAIGNYLI KMKVKDKDQI LFKGWGSQKPKYPKSS PLKAKNRRVEITI LNN 

f790.nt 

ATGAATACCAAGGCGACTACACCATTGTTGTTATTATTTTTAATTC 

AATTTAAATACATTAAAGGTTGAAAGTTTAGATO^ 

TAATTCAAGCTCTAATACC^U^TATTCA 

GCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTAC 

GCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTC 

AAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAA 

AGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGA 

ACCACATAATTCTTTCGAATTATGAATCACAATAC 

AATTTATTTTGATAATGAAATTGGCAATACATATAA^ 

AACCAACATTTTAAAATGATTGGAAACTCTCTTG 

AAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTGAAAAAAACA^ 

TTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATAC 

GCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACT^ 

AGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTT 

CCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTC 

CGAGTAGAAATTACAATATTAAATAACTAA 

t790.nt 

TCTGAAATCTTTGAATTTAAATACATTAAAGGTTCAAAGTTTAGAT^ 

TCAATGGCCArTATAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTC 

TGCAAGCATTAAAGCTTTTTTTAGAATCTTA^ 

GAAGAAATCTTCAGCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGAC 

TTCCAAGATTCCCAAAAACACCAATCAAAATAAATGAAAAATGGT^ 

AAAAATAGACAAAAGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACT 

AATGGCAAGCATTACGACATAATTCTTTCGAATT^ 

AAGTAGACC^AAAAATTTATTTTGATAATC 

AAATCAGAACAACT^CCAAC^TTTTAAAATGATTGGAAACTOT 

GATAATCTTATTGAAACTGAGGTTGAAAATTACATCCGAGA 

ATAAAGGTATTAATTTAAGCTTTGACATTGAATTOT 

AATTGACCTTATAGCTAAACTTCTTGAAAAATTTAAAAAAAATAACAT^ 

GGATTGGAAGAAGAGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATT^ 

TAAAAGACAAAGACCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAA^ 

GGCTAAAAATAGGCGAGTAGAAATTACAATATTAAATAACTAA 

f792.aa 

MKIFIYWWIFFFSVFKVFSIYSLTDEEFFKKYSLFFVHKGFLSKNVNGKITKVQVNGINSRV^ 
SIYEDVYSSSSFLTTSNNLWSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKR 
KNI*NRLIPQIYLGAGYYDIISAIEFSKEETNNLYFS 
KILWTYDNHFYSYINGQWVFIGKLSLQIX>D^ 

VIDFKDDNGNLTYSSKLSLPNKLRSVKNF IDVPYI LKKAKELGI WIJ^CVVFKDSKLYYYDNFKHALWNKKTNKP 

WAHLIKKVDSSGLVKWQVEHWVDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSIX^ 

VDALESFLIMAREQLYVPISVDIYGYNGWFPTN^^ 

YKEGSDRAI^SIJXJWIRPWQAFLIX5KERLVDDEIYLEYI^ 

LDSF 
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t792.aa 

IYSLTDEEFFKKYSLFF\mKGFLSKNVNGKI^ 

VSYDYSKNFRKIiVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGI YLSVNGAISFKNLNRLI PQIYLGAGYYDI I 
SAIEFSKEETONLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKNVEKILVRTYDNHF^ 
FIGKLSl^DQDFFEKSQRMQLAKNKGSIYLTAYTLRNKKAVDERFKFIKDSGMNAV^ 
NKLRSVKNFIDVPYILKKAKEIX3IWIARCVVFKDSKLYYYD 

HWVDIFSPATWETraiSIAKEIQSFGVDEIQFDYIRFPSIX5PVSIAISRMNKYEMQPTO 

VDI YGYNGWFPTNS IGQNI SMLSDYVDVI S PMFYPSHYTDDFLPSNFYYTKRAYRI YKEGSDRALAFSLDGVVIRP 
YVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLWNASNVYYMIKGSLKEYLDSF 

f792.nt 

ATGAAAATTTTTATCTATTGGGTAGTTATTTTC 
ATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTC 
AACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTT^ 
TCTATTTATGAGGATGTTTATTCTTCA^ 

CAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGG 

AGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGT^ 

AAAAATTTAAATCGTTTGATTCCGCAGAT^ 

CAAAAGAAGAGACAAATAATTTATATTTTTCCTCT 

ACTTATTAAAAAAATATCTTTTCCTTTCAAAAAGC 

AAAATTTTAGTCAGAACATATGACAATCATTTTTATTC 

CTTTGCAGGATCAGGATTTTTTTGAAAAA^ 

AGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTAT^ 

GTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCT^ 

TTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAA 

TGTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAAC^ 

TGGGCTCATTTGATTAAAAAAGTTGATTCT^^ 

TTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAA 

TGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCA^ 

GTTGATGCACTTGAATCTTTTTTGAT^ 

ACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAA 

TATGTTTTATCCTTCGCATTATACTGATGATTT^ 

TATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTA 

TATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTA 

GTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCT 

TTAGATTCTTTTTA 

t792.nt 

atatattcattaacx:gatgaagaattttttaaaaaatat^ 

atgttaatgggaaaataaccaaagttcaagtcaatgggataa^ 

tcctagtcgaattacttotatttatgaggatgtttattc 

GTTTCTTATGATTATTCAAAAAATTTTAGAAAATT 

CTAGTGCCTTTTCTCAAGGAGATTACAAGCGTATTGCTATTGGA^ 

TGGAGCTATTAGTTTTAAAAATTTAAATCGTTTGATTC 

AGTGCTATTGAATTTTCAAAAGAAGAGACAAATAATT^ 

TTAGTCAGAAAAGTGGATTTATTAAAAAAATATCTTTTCCT^ 

TAGTAAGAATGTAGAAAAAATTTTAGTCAGAACATAT^ 

TTTATTGGAAAATTATCTTTGCAGGATCAGGATTT^ 

GGTCTATTTATTTAACAGCATATACATTGCGT 

AGGTATGAATGCTGTTGTAATTGATTTTAAAGATGATAATG^ 

AATAAGTTGAGATCTGTTAAAAACTTTATTGATGTTC 

TTATTGCTAGATGTGTTGTATTTAAAGATTCAAAATTGTATTATT 

AAAAACCAATAAACCTTGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTC 
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CATTGGGTAGATATTTTTTCTCC 

TTGACGAGATACAATTTGATTATATTAGATTTCCATCAGA 

GTATGAGATGCAACCCGTTGATGCACTTGAATCTTTTTT^ 

GTTGATATTTATGGGTACAATGGCTGGTTTCCTAC 

TTGACGTCATATCTCCTATGTTTTATCCTTCGCATTATACTGA 

AAGAGCTTATAGGATTTATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTC 

TATGTTCAAGCTTTTTTATTAGGAA^ 

TTAAAGGAATTAAAGAGTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCT 
TAGTTTAAAAGAATATTTAGATTCTTTTTAA 

f797.aa 

MSIKKFILTLIILSLAKNSFSENEINIFEaOENYIVKENIKTEIKKLKQS 

LEGISYSFINVFSKIGSSAIISFDLSNEASKKYKIIKLEFLSPDK^ 

RTESLSKTIAEYYKDNNWYYILAAITVENNINKETEK^ 

t797.aa 

KNSFSENEINIFENENYIVKENIKTEIKIO^KQSFL 

SSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTLRTESLSKTIAEYYK^ 
NWYYILAAITVENNINKETEKYEIRINPKIYNDFQKKLRLHFKSNQIKKFPIPIIE 

f797.nt 

ATGAGCATTAAAAAATTTATTTTAACCTTGATAATTC 
ATATCTTCGAAAACGAAAATTATATTGTAAAAGAAAAT^ 

ACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGCAGATTTA 

CTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAA^ 

CAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAAT^ 

GCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGA 

AGAACTGAATCTCTTTCAAAAACAATTGCAGAATATTACAAAGATAACAACTGG 

CAGTAGAAAATAATATAAATAAAGAAACTGAAAAATACGAAATTAGAATTAACCCTAAAATATATAATC 
AAAAAAATTGAGATTAC^TTTTAAAAGCAACCAAATAAAAAAATT^ 

t797.nt 

AAAAATAGCTTTTCTGAAAACGAAATTAATATCTTC 

AAATTAAAAAACTAAAACAAAGTTTTTTACT^ 

AGATTTAAATGGAGAACCGATAAAAGAACTTGAAGGGATO 

TCTTCTGCTATTATTTCATTTGACOT 

GTCCAGATAAAGGCAATTTTATTAACCAGCTAAGCAGC^ 

AAAAGACGCTTACTCATTTGGTACATTAAGAA 

AACTGGTATTATATTTTAGCAGCAATAACAGTAGAAAATAATATAAATAAAGAAACTGAAAAATA 

TTAACCCTAAAATATATAATGATTTTCAAAAAAAATTGAGATTACA 

AATACCCATTATAGAATAA 

f799.aa 

MKKHIIIGIIFVAILLFFKILLIPRIQiraE 

IESNLLYISSKNFTTYANIIYQNESLLSI ILKSNGNNNVFYSKRIKPRGKI 
t799.aa 

HENNKNNIKMIISYKQDKNRLSLKINIKTKKTTNLGKAKLD 
I ILKSNGNNNVFYSKRIKPRGKI 
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f799.nt 

ATGAAAAAACATATCATTATTGCX3AT 

TTCAAAATCACGAAAATAATAAAAATAATATCAAAATGAT^ 

AAAGATAAACATAAAAACAAAAAAAACTACCAACCTGGGAAAAGCCAAACTAGATATTTATCTAGACAGTAAATTA 

ATTGAAAGCAATTTGCTTTATATAAGCAGCAAAAACTTTACAACAT^ 

TATTAAGTATAATATTAAAGAGTAATGGCAATAATAATGTCTTTTATAGTAAAAGA 

ATGA 

t799.nt 

CACGAAAATAATAAAAATAATATCAAAATGATAATAAGCTACAAG 

ACATAAAAACAAAAAAAACTACCAACCTGGGAAAAGCCAAACTAGATATTTATCTAGACAGTAAAOT 
CAATTTGCTTTATATAAGCAGCAAAAACTTTACAACATATGCTAAT 

ATAATATTAAAGAGTAATGGCAATAATAATGTCTTTTATAGTAAAAGAATAAAACCTAGA 
f800.aa 

MKKHYKALI LSLLFAI I SCNTKTLNELGEEQFKI PFGTLPGAIMPLNNKFTNSKFDIKTYNGLVYIAEIKTNKLMI 
FNSYGKLIQTYQNGIFKTNPDLKIKKIDFEGIQAIYPL^ 

EILGQEGLNGMPFPQI YDVNVDENGNI AI I SI YSEGYI I YSYNKEFSPLYKI YVNKNLLKTIDNQKKKYNISIDKV 

FFEVNKKTLYVKTTYYENIGDNENINDLGIKIK^ 

DKIIASTNMKNLSNNLIWKLDSKGSIKEQ 

t800.aa 

KTLNELGEEQFKIPFGTLPGAIMPLNNra^ 

LKIKKIDFEGIQAIYPLKDFIIVADKLNNKKSK^ 

DENGNIAJISIYSEGYIIYSYNKEFSPLYKIYVNKNLLCT 

NENINDLGIKIKDQYIYKMSLKKNKELEVINKIAL^ 

SKGSIKEQIALIEPPNI^LSESLSKIXSILSILYGGK^ 

fBOO.nt 

ATGAAAAAAGACTATAAAGCTCTTATATTAAGCT^^ 

AATTAGGAGAAGAACAATTTAAAATACCATTT^ 

AAATTCAAAATTTGACATCAAAACX5TATAACX3GGCTAGTG 

TTCAACTCATACGGAAAACTAATACAAACATATCAA 

AAATAGATTTTGAAGGAATTCAAGCAATATACCCACTAAAAGATTT^ 

AAAATGAAAATTCAACGAAAAAGAGAATATTGC^ 

GAAATTTTGGGTCAAGAAGGTTTAAACGGAATGCCATTTC 

ACATTGC^TAATATCAATATATAGCGAAGGATATATAATATATTCTT 

AATTTACGTCAACAAAAACCTGTTAAAAACAA^ 

TTTTTTGAAGTCAACAAAAAAACTCTTTATC 

ACGATCTTGGAATTAAAATTAAAGATGAATATA 

AAATAAAATTGCTCTTCCTAAAAACTTACTAGAT^ 

GACAAAATAATAGCATCTACTAATATGAAAAATTTATCTAACAATTTAA 

TTAAAGAACAAATAGCTTTAATTGAGCCTCCAAATTTA^ 

TAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTT^ 

t800.nt 

AAAACTTTAAACGAATTAGGAGAAGAACAATTTAAAATACCATTT^ 

ATAACAAATTTACAAATTCAAAATTTGACATCAAAATO 

TAAATTAATGATTTTCAACTCATACGGAAAACT^ 

TTAAAAATAAAAAAAATAGATTrTCAAGGAATTCAAGCAATATACCCACTAA 
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AACTAAATAATAAAAAATCAAAATTCAACCAAAAAGAGAAT^ 

AAACTCATCTGTAGAAATTTTGGGTCAAGAAGGTTTAAACGGAATGC 

GATGAAAATGGCAACATTGCAATAATATCAATATATAGCGAAGGATATATAATATATTC 

CCCCGCTTTATAAAATTTACGTCAACAAAAACCTGTT^ 

AATAGATAAGGTTTTTTTTGAAGTGAAC 

AATGAAAATATAAACGATCTTGGAATTAAAATTAAAGATCAATATATC 
AATTAGAAGTGATAAATAAAATTGCTCTTCCTAAAAACTTACTAGATC 
AAAAATACAAAAAGACAAAATAATAGCATCTACTAATATC^ 
AGCAAGGGCTCAATTAAAGAAGAAATAGCTTTAA 

AAGATGGAATACTTAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTTT 
AAAATTATAA 

f810.aa 

MYKLFLFFIIFMFLSCDEKKSSKNl^SVKIGYVNWGGETAATNVLK^ 
GTVSSWVPTADKFYYEKLKTKFVDLiGANYEGTIQGFVVPSYV 

ALNYYGLSKEYELVPSSESVMIASLDSS IKRNEWI LVPLWKPHWAFSRYDI KFLDDPDLIMGGI ESVHTLVRLGLE 
NDDFDAYYVFDHFYWSDDLILPLMDKND^ 

t810.aa 

CDEKKSSKNLKSVKIGYVNWGGETAATNVLK^ 

EKLKTKFVDIX5ANYEGTIQGFVVPSYVPI SSI SELKGKGDKFKNKMIGIDAGAGTQI VTEQAIiNYYGLSKEYELVP 

SSESVMLASLDSSIKRNEViHILWLWKPHWAFSRTO 

SDDLILPLMDKNDKEPGKEYRNAVEFVEKNKEIVKTWVPEKYKTLFD 

f810.nt 

ATGTATAAATTATTTTTATTTTTTATTATTTTTATC 

AATGGGCTACAATGKIAGAAATATTTTCAGTTA 

GGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATO 

GTGCAAATTATGAAGGAACCATTCAAGGTTTTC 

GGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGC^^ 

GCGCTTAATTATTATGGATTAAGTAAAGAGTAra 

ATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTC 

TAAGTTTCTTGATGATCCTGATTTAATTATGGGG 

AATGATGATTTTGATGCATATTATGTTTT^ 

AAAATGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTGAATTTC 
GTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 

t810.nt 

TGTGATGAAAAAAAGAGTTCAAAGAATTTAAAATCGGTAAA 

CTACAAATGTATTAAAGGTTGTTTTTGAGAAAATGGGCTACA^ 

GTATCAATACTTAGCATCTGGAAAGATAGACGGTAC^ 

GAAAAACTGAAAACAAAGTTTGTTGATCTTGGTGCAAATTA C AAGCT 

ATGTTCCAATTTCCAGCATTAGTGAGCTTAAGGGTAAAGGT^ 

TGGTGCGGGAACTCAAATTGTTACAGAACAAGCGC 

TCAAGTGAGAGTGTTATGCTTGCAAGTTTAGATTCT^ 

AGCCTCATTGGGCTTTTTCTAGGTATGAT^ 

CGTGCATACTCTTGTTAGACTTGGTCTTGAAAATGAT^ 

AGCGATGATTTAATATTGCCCTTAATGGATAAAAA 

TTGTTGAAAAGAATAAAGAGATTGTAAAGACGTGGGTTCCAGAAAAATATAAGACCTTAT^ 



£814. aa 
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MLVKRIVGKPITMLILFSLLLMISLYTFS 

KNIYSVSSKESSWSLEFTHGTDLDLVLNEIRDALELVKSS 

RYADEIIKPGLERLDGVAIVTVNGGSKKRVLIEVSQNRLESYGLSK 

SGKFKSIEEIGNVVTAYKIPDISSGINLSPIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVS 

NWMNEIEKLKLSMPKDMKLEIASDSTDFI 

FCLMYFVNISLNIMSLAGLALGIGMVVDCSIWIDNIYKYRQK^ 

LIFKSELGVYGDFFKDFTFTIVISIX^SIJ^VAIFLVPVLSSH^ 

YINLLNIVLNHKLIFGLrVFFSFIGSLLLGLLLDVTTFTO 

KGYKSIIATIJIADRITFNVLFPLKEESRDNLTQSVDYDSIKYKIMNRIGNLYPEFNIE 

SANDFEYIKDYGKI LVSMLKKEI PELVNPRLS I SDFQLQIGVEIDRALVYNYGIDMOTILNELKANINGWAGQYV 
EKGLNYDIVLKLDRMDVKNLKDLEKIFITO 

VTAKVVDFINNKVPHKEGITLKVEGEYNEFSNIMNQFKI I IMMAI IWFGIMASQFESFLKPFI I IFTI PLTAIGV 
VL IHFLAGEKLS I FAAIGMLMLVGWVNTG I VLVDYTGLLIKRGFGLREAI I ESCRSRLRPI LMSSLTS I IGLI PM 
AFSSGSGNELLKPIAFTFIGGMTASTFLTLFFIPMLFEIFPTCFKFQI 

t814.aa 

RLKVDLLPGIDI PQI SIHTVYPGASPREVEESVSRVLESGLSSVKNLKNI^ 

EIRDALELVKSSLPSKSQTPRIFRYNLKNI PVMEIVINSVRPVSELKRYADEI IKPGLERLDGVAIVTVNGGSKKR 
VLIEVSQNRI^SYGLSLSRISSIIASQNLELSAGNILENNLEYLVEVSGKFKSIEEIGNWIAYKIPDISSGINLS 
PIEIKLKDIANIKTDFEDLSEYVEYNGLP 

IKASI STWNSAYFGAMLAI FVT FFFLRSFRATI I IGIS I PI AI VLTFCLMYFVNI SLNIMSLAGLAIXSIGMVVIX? 
SIV^DNIYKYRQKGAKLISSSI3JGAQEMMLPITSSTFTSICVFGPFLIFKSELGVYGDFFro 
LVAIFLVPVLSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFLYIN^^ 
GLIiDVTTFTRGKENSITINLNFPHKTNLEYA 

NLTQSVDYDSIKYKIMNRIGNLYPEFNIEPSISGNALGGGDSIKIKI SANDFEYIKDYGKI LVS 

RLSISDFQLQIGVEIDRALVYNYGIDMNTILNELKANINGWAG^ 

NSSGVKIPFSSIATFEKTNKAESIYRENQALTIYLNAGISPDDNLTQW^ 

FSNIMNQFKI I IMMAI I WFGIMASQFESFLKPFI I IFTIPLTAIGWLIHFLAGEKLS IFAAIGMLMLVGVWNT 
GIVLVDYTGLLIIQIGFGIjREAI I ESCRSRLRPI LMSSLTS I IGLI PMAFSSGSGNELLKPIAFTFIGGMTASTFLT 
LFF I PMLFEI FPTCFKFQI 

f814.nt 

ATGTTGGTAAAGAGAATAGTTGGCAAACC^TAACAATGTTG^ 

ATACCTTTTGAAGATTAAAAGTAGATCTTTTC^ 

TGGCGCTTCTCCTAGAGAAGTTGAAGAGAGTGTTTCTA 

AAAAATATATATAGTGTATCTTCCAAAGAAAGCAGC^ 

TGGTTTTAAATGAAATTCGAGATGCTCTTGAATTGGTAA 

TTTTAGATACAATCTTAAAAACATCCCTGTAATGGAAATTC 

AGATATGCCGATGAAATCATTAAACCTGGGCTTGAAAGGCT^ 

GTAAAAAGCGTGTTTTAATTGAAGTTTCTCAAAACAGGCT 

AATTATAGCATCCCAAAATTTGGAACTTTCAGCTGGCAATATA 

TCTGGAAAATTTAAATCAATTCAAGAGATAGGTAATGTG^ 

TAAATTTATCTCCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAAC^ 

TGTTGAATATAATGGGTTGCCTTCAATTTCTTTGra 

AATGTTGTTATGAATGAAATAGAAAAATTGAAATT^^ 

GTACTGATTTTATTAAAGCATCCATTTCAAC 

TATTTTTTTCTTTTTAAGAAGCTTTAGG 

TTTTGTTTAATGTATTTTGTAAATATTTCTC 

TTGTTGACTGTTCAATTGTOOTAATAGACAATATATACAAATA 

TATTCTCGGAGCTCAGGAGATGATGTTGCCTATO 

CTTATTTTCAAATCAGAACTTGGGGTATATGGAGATTTTO 

GTGTTTCTCTTTTAGTTGCAATTTTTTTC 

AAAGAATATTAAGAATGCTTTTATTAGGAAAATCGATGCCTTTT^ 
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TATATGAATTTATTAAATATAGTTTT^ 

GCTTGCTTTTAGGATTATTGTTAGATGTGACAACTTTTACTAGAG 

TTTTCCCCACAAAACTAATTTGGAATATGCAAAATTTT 

AAAGGATATAAAAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTT^ 

AATCAAGAGATAATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATC 

TTATCCTGAATTTAATATTGAGCCTTCCATTAGTGGCAATGCTTO 

TCGGCCAATGATTTTGAATATATAAAAGATTATGGAAAAATTTT 

TTGTAAATCC7UVGGCTTAGCATAAGTGATTTTC 

TTATGGTATTGACATGAATACCATTrTAAATGAGTTGAAGGCCAATATT^ 
GAGAAGGGACTTAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTT 
TATTTATTACAAATTCATCTGGAGTTAAAATTCCTTTTTCATC 
ATCTATTTACAGAGAAAATCAAGCTTTAACCATTTATC 

GTAACCGCAAAAGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTC 

AATATAATGAATTTTCAAATATCATGAATCAGTTTAAAA 

TATGGCTTCTCAATTTGAATCTTTT^ 

GTGCTTATACATTTTCTTGCAGGAGAAAAGCTTTCTA 

TGGTAAATACAGGAATTGTTCTTGTAGACTATACTGGTTTAT^ 

TATTGAATCTTGTCGTTCAAGGCTTAGGCCAATTTTAATGTC 

GCATTTTCTAGCGGAAGTGGAAATGAACTTCTAAAA 

CATTTCTTACTTTGTTTTTTATTCCCATGCTTTTTC 

t814.nt 

AGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTC^ 

CTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGOT 

TAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATC 

GAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTC^ 

ATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAAT^ 

TGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAA 

GTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTAT^ 

CCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAA 

TAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCT^ 

CCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTC 

ATGGGTTGCCTTCAATTTCTTTGTCGGTTCAA 

GAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAG^ 

ATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCT^ 

TTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTC 

GTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTC 

TCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTT 

CTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTA 

ATCAGAACTTGGGGTATATGGAGATTTTTTC 

TTAGTTGCAATTTTTTTGGTTCCTGTTTT 

AGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTG^ 

ATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTC 

GGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAA 

AAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAG^ 

AAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCA^ 

AATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATC^ 

TTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGA 

TTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATC 

AGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGA 

ACATGAATACCATTTTAAATGAGTTGAAGGCCAATATT 

TAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTO 

AATTCATCTGGAGTTAAAATTCCTTTTTCATCAATAGCCACC 

GAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTC 

AGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTCTTAAGGT^ 
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TTTTCAAATATCATGAATCAGTTTAAAATAAT^ 

AATTTGAATCTTTTTTAAAACCCTTTATO 

TTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTC 

GGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGG^ 

GTCGTTCAAGGCTTAGGCCAATTTTAATGTC 

CGGAAGTGGAAATGAACTTCTAAAACCAATTGCATT^ 

TTGTTTTTTATTCCCATGCTTTTT^ 

f818.aa 

MIJCNHSKLIIQLKVVMMIYLK^ 

GEKHGNGVWPEENFILIIYTSNQSIVERLKDr\n)DLNRSYPTEGINLFVIJWS 
t818.aa 

KKMGNDMTKFYNYRI EIVSNLSLEI^WECIEKIEQELGESIYYSKIGWVYGKGKKGEKHGNGVWPEENFILI IYT 
SNQS I VERLKDIVDDLNRSYPTEGINLFVLRNS 

f818.nt 

ATGTTGAAAAATCATTCAAAATTAATAATTC^^ 

ATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCT 

AGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTC 

GGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTAT^ 

TTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTC 

AAATTCTTAA 

t818.nt 

AAGAAGATGGGGAATGATATGACTAAATTTTATAATTATAGGATTGA 

ATGTTTTTGAATGTATAGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATO 

TGGAAAAGGTAAGAAGGGAGAAAAGCATGGTAATGGCGTT^ 

TCCAATGAGTCTATTGTTGAGCGATTAAA 

ATCTTTTTGTTTTGAGAAATTCTTAA 

£820.aa 

MLNNTYRIKTILTIFLAITLLTIYKYFTLMAFNNSPDOT 

IENIVSTSETLGAILQINSRILKEKLSSNKGFLYIKRKIKREESDLIKRIQAEGRLSNI 
NITGFVGTDNLGLEGIEFSLNSILGKDKTK^ 

NSQNGEILSMVQFPQYDANFYSKYPEEIRKNLSSSLTYEPGSINKIFW 
GBKITIKTLNPPYKHIDSTEILIYSSNVGIAYITEKVSN 

SKATIGFGQEIGVSAVQILQAASILSNNGIMLKPRI IKKISNDKGENIKEFDKEEIRKVI SKNSAQKVLKMMREW 
NKGGIP1ILKIKNLDISAKSGTSQAIDRKTGKYSEEDYTSSILAI YPTEQPKYI IYTVYRYPKKI IYGTRIAAPMAK 
EIIEFIEHQQNTIAYKKIKMPSKIKIPKAETNYKNKTYLPNFINLSKREM 
SPNTKLEDITELELYLK 

t820.aa 

FNNSPDOTISLKSNDIAIOIGTIYDRNGK^ 

FLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTO 

QFLNEEPETNNIHLTIDMDIQQGVSKIAK^ 

LSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPSGEKITIKTLNPPYKHIDSTEILIYSSWGIA 
YITEKVSNEYFYKKLI^FGFGEKVG^FPGETKGLLNHYSKWSGRSKATIG 

LKPRI IKKI SNDKGENIKEFDKEEIRKVI SKNSAQKVUCMMREWNKGGI PNLKIKNU3I SAKSGTSQAIDRKTGK 
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YSEEDYTSS I LAIYPTEQPKYI I YIVYRYPKKI I YGTRIAAPMAKEI IEF I EHQQNTIAYKKI KMPSKIKI PKAET 
NYKNKTYLPNF INLSKREAIDI LKYYKNTMKIKINGDGFVYKQSI S PNTKLEDITELELYLK 

f820.nt 

ATGCTTAATAACACTTATCGAATAAAAACAATATT^ 

ATTTCACACTAATGGCCTTC^TAACAGCCCAGACAACACAATATCTT^ 

AACAATTTATGATAGAAATGGCAAACCAATAGCATTCTCTTC^ 

ATAGAAAATATTGTAAGCACATCTGAAACTCTTGGTGCAATACT^ 

TTTCCTCTAACAAAGGGTTTTTATATATAAAAA 

AGCTGAAGGCAGGCTTTCAAACATCACTTTATATCCTGATTACAC^ 

AATATTACTGGTTTTGTAGGAACAGATAATCTTGGCCTT^ 

AAGATAAAACCAAGCAACAATTTTTAAATGAGGAGCCAGAAACAAACAACATC 

ACAACAAGGTGTTAGCAAAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAG 

AACTCCCAAAATGGAGAAATATTATCCATGGTTCAATTO 

AAGAAATCCGAAAAAACCTTTCTTCATCTOTAACCTATGAGCCCGGAAGC^ 

AATATTAGAAAGTGGAAAATTAAATTTAGAAGAAAAATTTTTAGA^ 

GGAGAAAAAArTACAATCAAAACATOAAATCCCCCCTATAAACATATCGACTCTACAGA 

CAAATGTTGGAATAGCTTACATTACTGAAAAAGTTAG 

TGGGGAAAAAGTTGGAGTTCCATTTCCCGGAGAAACA^^ 

AGTAAAGCTACAATTGGATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATT 

GCAATAATGGAATAATGCTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAAAGGAGAAAAT^ 

TGATAAAGAAGAAATAAGAAAAGTAATATCCAAAAATTCAGCACAAAAAGT^ 

AATAAAGGTGGAATTCCAAATCTTAAAATTAAAAATCTTGACATTTC 

ATAGAAAAACGGGAAAATACTCAGAAGAAGACTATACATCTTC 

ATATATTATTTACATTGTATACAGATACCCAAAAAA 

GAAATAATAGAATTTATTGAGCACCAACAAAATACAATAGCATATAAAAAAATTA 
TCCCTAAAGCTGAAACTAATTACAAAAACAAAACATACTTACCAAATTTTATCAACC 
AGACATACTAAAATACTATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTT 
TCCCCCAATACAAAATTAGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

t820.nt 

TTGAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATG 

ATGGCAAACCAATAGCATTCTCTTCAAAATCCTACT^ 

CACATCTGAAACTCTTGGTGCAATACTTCAAAOT 

TTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAA 
CAAACATCACTTTATATCCTIK^TTACACAAGAATTTATCCC^ 

aggaacagataatcttggccttgagggk:attgaattttccct 

caatttttaaatgaggagccagaaacaaacaacattc 

aaatagctaaaaaatactttaaagaaaataatcctgaaatf 

aatattatccatggttcaatttcctcaat^ 

ctttcttcatctctaacctatgagcccggaagcattaata 

aattaaatttagaagaaaaatttttagacaatggaatatatc 

caaaacattaaatcccccctataaacatatcgactc^ 

tacattactgaaaaagttagcaatgaatactttt^ 

TTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATO 

ATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGO 

CTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAAAGGAGAAAATATTAAAGAATO 

GAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAAl^ 

AAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAAC^ 

TACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACC 

TATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAC^ 

TGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATGCCATCAAAAATC 

AATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCT 
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ATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATOT 
AGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

f831.aa 

MAKNNLLVFFIAI IFVFVSI IWFYNSLGKDYVKSGGEIVENLEKDLNDYLKEN^ 

SSYF I SRFYLARAVYFQSQAQYDEAI KDLDIVI KAKGIESE IAFLNKAAVYEKMGLKEDALLWEDLINSTSLDFL 
KVRALLSKAILIEEKDKEIAVKVYEEIVKFPYENNLYINMANNKILELKQN 

t831.aa 

YNSIX5KDYVKSGGEIVENLEKDLNDYIjKEtn>AKEREKI FLRIRELI SKEKEISSYF I SRFYLARAVYFQSQAQYDE 

AIKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLVYEDLINSTS^ 

EEIVKFPYENNLYINMANNKILELKQN 

f831.nt 

ATGGCTAAAAATAATCTTTTAGTTTTCTTTATTGCTATTATO 

ATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTC 

AAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAG^^ 

tcatcttattttatttcaaggttctatttagccagag^ 

ttaaagatttagatattgttattaaggcaaaaggtattgaaagtc 

tgaaaaaatgggattaaaagaagatgctttgttagtttatgaagatc 

aaggtaagagctcttttgagtaaggcaatattgatt^ 

agattgttaagtttccgtatgaaaataatttatatataaatatggcaa^ 

TTAA 
t831.nt 

TATAATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATC 

ATTTAAAAGAAAATGATGCGAAAGAGAGAGAAAAAAT^ 

AATTTCATCTTATTTTATTTCAAGGTTCTATTO 

GCTATTAAAGATTTAGATATTGTTATTAAGGCAAAA 

TTTATGAAAAAATGGGATTAAAAGAAGATGCTTO 

TTTAAAGGTAAGAGCTCTTTTGAGTAAGGCAATATT^ 

GAAGAGATTGTTAAGTTTCCGTATGAAAATAATTTAT^^ 

AAAATTAA 

£843. aa 

MKAIGNAI LLNMPLI F S I G I S IGVARMGQGTAALGGLIGYLTFNITENYF I EAFSGLVEAETMS SVGRINFFGVQT 
LNTGIAGSLAVGLLVGYLHNKFYNMKLPKP 

FEYFGSFLYGFLNRLLLPI^LHS ILSFPFEFTSLGGVEIVNGDTVRGLKNI FYAQLLDPSLGKFSSGFAKI SSGFY 

LSIMFGLPGAALCVYKGIVHEDKNKVAALLFSGA 

WIGNSFSTGFIJDFFMFGILQGNSKTNWISVLPIX3AMFFALYYFTFS 

IAHLLIQGLGGFDNITKLDVCSTRLHVDVVNTELVDNNLL 

SLFEASVMVAVDNVKKGFKTYIEMKEDKKLEKQGKSGKTYKI*SELEED 

t843.aa 

RMGQGTAALGGL I GYLTFN I TENYFI EAFSGLVEAETMS SVGRINFFGVQTLNTG I AGS LAVGLLVGYLHNKFYNM 
KLPKPFWFSECHFVPrVTILPFCWIAIFFCLIWSSFDDLIASIXJLFVFI^ 

SFPFEFTSLGGVEIVNGDTVRGLKNI FYAQLLDPSLGKFSSGFAKI SSGFYLS IMFGLPGAALGVYKGIVHEDKNK 

VAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFVHAAYSGFAIJJ^ANFFNVTI 

TNWISVLPIXSAMFFALYYFTFSWLYRYFDFQIFVTDDPFFEGQEGKLESL^ 
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HVDVVOTELVDNNLLKEAGVLKIGLW 
EDKKLEKQGKSGKTYKLS ELEED 

f843.nt 

ATGAAGGCTATAGGCAATGCTATTCTTCTCAATAT^ 

GAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTO 

TGAGGCTTTTTCAGGGCTTGTTG7VAGCAGAGACAA 

TTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTT^ 

AGCTACCCAAACCTTTTGTGTTT^ 

TTTGGCTATATTTTTTTGTTTGATT^ 

TTTGAATATTTTGGCAGTTTTCTTTATGG^ 

CTTTTCCTTTTGAGTTTACTTCTTTGGGAGG 

ATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATC 

CTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTT 

TTGCAGCACTTCTTTTCTCTGGGGCCTTGACA 

TTTCACAGCGCCTTTGCTTTATTTTGTTC 

GTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTO 

CA7VATTGGATTAGTGTATTACCTTTGGGGGCAATGTTOT 

ATACTTTGATTTTCAGATATTTGTT^ 

ATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTO 

ATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTT 

TAATGGGAAGGTTCAGCTTTTTTATG 

AGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAA 

AAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCT^ 

t843.nt 

AGAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACA 

TTGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTC 

TTTAAATACGGGAATTGCAGGTTCTTTA 

AAGCTACCCAAACCTTTTGTGTTTTTT^ 

TTTTGGCTATATTTTTTTGTTTGATTTGG 

GTTTGAATATTTTGGCAGTTTTCTTTATGGATT^ 

TCTTTTCCTTTTGAGTTTACTTCTTTGGG 

TATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTT^ 

TCTATCTATTATGTTTGGACTGCCCGG^ 

GTTGCAGCACTTCTTTTCTCTGGGGCCTTGAC^ 

TTTTCA(^GCGCCTTTGCTTTATTTTC 

TGTTA(XATTGGCAATAGCTTTTCTACTGG 

ACAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGC 

GATACTTTGATTTTCAGATATTTGTTACAGACGATCCAT^ 

AATTGCGCATCTTTTAATTCAAGGTCTTGGTGGA 

CATGTAGATGTTGTTAATACTGAGCTTGrTCATAATAATTTGCT^ 

TTAATGGCAAGGTTCAGCTTTTTTATGGATC 

GAGTCTTTTTGAAGCTAGTGTTATGGTTGC^GTTGA 

GAAGAOVAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGC 

f850.aa 

MRFKKIFLIIFIISNIJCVYSYNYAIQYKNEGIDKYYFEILND^ 

HMLTYRGYKDSPKSLI SRTDLI EIGFMYYFPI LLLINGKNFGEIDLGIGVKNLLFGDWGGHLMQS I IHLI LNQHRP 

IPSIKSYDSYNYRGFLSFALNYSYMNFLNLENYMDLSYFA^ 

YSKTQEAETGIGINYQFTSKNFFITNNLNIKNFSTKENFLSV^ 

IMIPLKIRNSLFYKINENINHYFSISTNYYTNYNETNSFTNQLSSGIMYEFL 

KSISRPIRIKNILQVGIENELGFLFKMLKYRNTE^ 
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t850.aa 

YSYNYAIQYKNEGIDKYYFEILNIXSFGFSLSD^ 
TDLIEIGFMYYFPILLLINGKNFGEIDU3IGVKN^ 
ALNYSYMNFLNLENYMDLSYFADYFIKNSIGITLK^ 
SKNFFITNNLNIKNFSTKENFLSVGGFGIIITPEEYKKISES^ 

INHYFSI STNYYTNYNETNSFTNQLS SGIMYEFLPQKTFNPYLI SGLFFAYNQNNKDIKS I SRPIRI KNI LQVGI E 
NELGFLFKMLKYRNTEYIFKI YSKVNYI PIAYNLDEKKLEKHSINFNYLGIGIVVK 

f850.nt 

ATGCGGTTTAAAAAAATATTTTTAATAATATTTATAATTO 

AATATAAAAATGAAGGTA1TGACAAATATTATTTTGAAATACTAAATC 

TTTTGATGACTTGAGAAGTGGTTCTCTTATTTOT 

CACATGTTAACCTATAGGGGTTATAAAGACTCTC 

TCATGTACTATTTTCCAATTTTATTGCTAATTAATGG 

AAACITATTATTTGGAGACTGGGGAGGGCAT^ 

ATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGAT^ 

ATTTTTTAAATTTAGAAAATTATATGGAC 

CTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTA 

TATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAATTOT 

ATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAA 

TGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTAT 

ATTATGATCCCATTAAAAATAAGAAATTCATTATTTTATAAAATAAATGAAA 

CAACAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCA^ 

TTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATO 

AAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTO 

AAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTAACTATA 
CTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTO 

t850.nt 

TATTCTTATAATTATGCAATCCAATATAAAAA 
TCGGATTTTGATTAAGCGATTTT^ 

TTTTATAATAAATTTAGAAGCACACATGTTAACXrrATAGGGG 

ACAGACTTAATTGAAATAGGCTTCATGTACTATTTTC 

TAGACTTGGGAATTGGAGTTAAAAACTTATT^ 

GATTTTAAATCAACACCGTCCAATTCCAAGTATAAAAAGCT 

GCTCTAAATTACTCTTACATGAATTTTTTAAATTTA^ 

TTAAAAACAGTATTGGAATTACCTTAAAAAATC 

TGJUW^TCAAAAGCCTCAAAACATATTCA 

TCTAAAAATTTTTTCATAACCAATAATTTAAACAOT 

GATTTGGAATAATCATTACACCTGAAGAATACAAA 

TAATTTTTACTTTGGATTTGATATTATGATCCCATO 

ATCAACGATTACTTTTCAATATCAACAAATTAT^ 

CATCAGGCATCATGTATGAATTTTTAC 

TAATCAAAACAATAAAGATATCAAAAGCATCTCAAGACCAATAAGAATAAAAA^ 
AATGAATTAGGATTTTTGTTCAAAATGCTAAAATACCGCAACA 
ACTATATTCCTATAGCTTATAACTTAGATGAAAAAAAATTAGAAAAACA 
TGGAATAGTCGTTAAATAA 

f853.aa 

MKSFLFWVILGTVGI SSFAQNTPVAI INLYKNEI ITKTGFDSKVDI FKKTQGRDLTDAEKKQVLQVLIADVLFSQE 
ASKQGIKI SDDEVMQTIRTQFGLVNFTDEQIKQMI EKQGTNWGELLSSMKRSLSSQKLVLKQAQPKFSEI KTPSEK 
EIVEYYEANKTKFVNPDISRVSHIFFSTKDKKRSDVLIX2AKNILSQIRSKKITFEEAVRKY 
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LSRGIXJNAQNLLGADFVKEVTOT 
VQQQQIWQVQQDMYGKLNKSANIQILDSSLK 

t853.aa 

QNTPVAIINLYKNEIITKTGFDSKVDIFKKTQGRDLTDAEKKQVLQVLIADVL 

QFGLVNFTDEQIKQMIEKQGT1WGELLSSMKRS LSSQKLVLK^ S 

RVSHIFFSTKDKKRSDVLDQAKNILSQIRSKKITFEEAVRKYSNDESSKAK^ 

VFNFNKGDISSPIASKEGFHIVKVTEKyAQRFI^LNDKVSPTADLIVKD 

KSANIQILDSSLK 

f853.nt 

ATGAAGAGTTTTTTATTTTGGGTAATAT^ 

TTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGT^ 

AGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTC^^ 

GCTTCAAAGCAAGGAATTAAAATCn^AGATGATGAGGTTATGCA 

TTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACAAATTGGGG 

ATCTCTGTCTTCTO\AAAGCTTGTTTTAAAGC^ 

GAAATTGTTGAGTATTATGAGGCTAATAAAACTAA 

TTTCTACTAAAGATAAAAAAAGATCAGATGTTTTAGATCAAGCAAAA 

AATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATC 

TTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTG 

GTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATA 

TTTAGGTTTGAATGATAAAGTGTCTCCTACTGCA 

GTTCAACAACAGCAAATTGTTGTTCAAGTAC^ 

TCTTGGATTCTAGTCTAAAATAA 

t853.nt 

CAAAATACTCCTGTTGCTATTATTAATTO 

ATATATTTAAAAAGACCCAAGGTAGAGACTTAACTGATGCTGAGAAAAAGCAA 
TGTTCTTTTTAGTCAAGAGGCTTCAAAGCAAGG 
CAATTTGGGCTTGTGAATTTTACTGATGAACAAAT^ 
TTTTGTCTTCAATGAAAAGATCTCTGTC 

TAAAACTCCTAGTGAGAAAGAAATTCTTGAGTATTATGAGGCT 

AGAGTTAGTCATATCTTTTTTTCTACTAAAGATAAAAAAA 

GCCAAATAAGATCAAAAAAAATTACTTTTGAAGAAG^ 

AAATGGTGATCTTGGGTTTTTATCAAGAGGTG^ 

GTTTTTAATTTTAATAAGGGTGATATATCTTCGCCTATTC 

AAAAATATGCTGAGAGATTTTTAGGTTTG 

AAGAAATAACATGATTAATGTTCAACAACAGCAAATTff 

AAGTCTGCAAATATACAAATCTTGGATTCTAGTCTAAAATAA 

f859.aa 

MKLPKLYKLILLFLFTTRLFSVKDEKSDNKLEL^ 
IYIQKSKKINYPNRNLGNNINQKTANDVNF^ 
III KMLKTKG YTL I EYI EDNN 

t859.aa 

VKDEKSDNK^ELFSNVETKIKKNSKN^ 
QKTA 



f859.nt 
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ATGAAATTACCAAAACTTTACAAATTAATACTACTCT^ 
AATCAGACAATAAATTGGAATTATTTTCAAATO^ 
TTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTO 
ATATACATACAAAAATCAAAAAAAATTAATTACCCG^ 

ATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATA^ 
AAATX^TAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCG 

ATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAATTAA 
t859.nt 

GTAAAAGATGAAAAATCAGACAATAAATTGGAATTATTTTCAAAC^^ 
ATTACGACTCAAATTCAAACAGCAAAAAGATCAAAAAAGAATCAA 

TATAAATTCCAATATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTA 
CAAAAAACTGCAAATGATGTAAATTTTACAAAAACTAGT^ 

TTCAAGAAATTAAAAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTA^ 

AGATAATCTAGGAATAATAATTAAAATGCTAAAAACAAAGGGATACACT 

TAA 

£861. aa 

MKNFKEVI 1 1 FDSGIGGLS YFKYIKSRIGGCQYVYVADNKNFPYGEKS PEYLLEAVLFLI EKLKKI YNIGALVLAC 
NTISVSVYNKIJIFVFPVVYTLPDVSSVSDLVL^ 

YKKYAI^CLEALKFEVVNTGREIWLGCTHYLHLKVMIEDFLKI PVYENREL WKNL I RSMNF S EHKGNYYKNDFD 
FWDEFYLTENKNLTFYQNFCKKYNLRFKGMIV 

t861.aa 

RIGGCQYVYVADNKNFPYGEKS PEYLLEAVLFL IEKLKKI YNIGALVLACNT I SVSVYNKLNFVFPWYTL PDVS S 
VSDLVLKRVLLIATNTTLESKFVKDQ^ 

GCTHYLHLKVMIEDFLKI PVYENRELVVKNLIRSMNFSEHKGNYYKNDFDFVDDEFYLTENKNL 
RFKGMIV 

f861.nt 

ATGAAAAATTTCAAAGAAGTAATAATTATTTTTGATT^ 

GTAGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAA 

TCTTCTAGAAGCAGTTTTGTTTTTGATTC 

AATACAATTTOTGTTAGTGTATACAATAAATTAAATTTTG 

CAGTTTCAGATCTTGTTTTAAAAAGAGTTCTT^ 

TCATVGTAAATATACATAATGATTTGATTGTAAAAGCTGC^ 

TACAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTC 

TTGGATGCACGCATTATTTGCATCTTAAGGTAATGATA 

ATTAGTGGTAAAAAATCTTATTAGATCAATG 

TTTGTAGATGATGAGTTTTATTTGACCGAAA^ 

TTCGCTTTAAGGGAATGATAGTTTGA 

t861.nt 

AGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTC 

TTCTAGAAGCAGTTTTGTTTTTGATTC 

TACAATTTCTGTTAGTGTATACAATAAATTAAAT^ 

GTTTCAGATCTTGTTTTAAAAAGAGTTC 

AAGTAAATATACATAATGATTTGATTGTAAAAGCTC^ 

CAAAAAATATGCTCTTAGATGTTTAGAAGCTTT^ 

GGATGCACGCATTATTTGCATCTTAAGGTAATC^ 
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TAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACA 

TGTAGATGATGAGTTTTATTTGACC 

CGCTTTAAGGGAATGATAGTTTGA 

f363.aa 

MIRLKVLILCLFGIFVLNGFADTNFEFTOFGGGVAFPV^ 

LANIAKAGIRYGTYAQFGAKFDDFVSIGFELLFNINLLKAIKRSTC 

TGPKINIATSSADSVIJ^IX;TMGM>IGARLSFSF 

t363.aa 

DTNFEFNFGGGVAFPVSPFS SFYNEALEINAKLKQNLPSDLS PI EKEEIVQNFSDLANIAKAGIRYGTYAQFGAKF 

DDFVSIGFEa,LFNINIJLKAIKRSDGT^^ 

MGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV 

f363.nt 

ATGATTAGGCTTAAAGTTTTAATTTTGTGTTTATTTG^ 

AATTCAATTTTGGTGGTGGGGTTGCTTTTCC 

TGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCX: 

TTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTC 

CTATTGGATTTGAGCTTTTGTTTAACATTAATCT^ 

TTTCTCGTTTATTATGGCAATAACACCAAG^ 

ACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTC 

TTGGTGCTAGACTTTCATTTTCTTTTTTA^ 

TGATTTCAAGTTTGGAATAGGTTTTGAATTTG 

GAATTGTGTAG 

t363.nt 

GATACTAATTTTGAATTCAATTTTGGTGGTGGGGTTC 

CTTTAGAGATTAATGCAAAGCTTAAGCAAAAT^ 

AAATTTTTCCGATTTAGCCAATATTGCTAAA 

GATGATTTTGTTTCTATTGGATTTGAGCTTTTC 

CTGCAAATGAAAATTTCTCGTTTATTAT^ 

TTTAGCGTTTTTCACAGGTCCTAAGATCAATATAGCGACT^ 

ATGGGCTGGGATATTGGTGCTAGACTTTCATTTTC 

ACCCTAAATTTTCTGATTTCAAGTTTGGAATAC^ 

f368.aa 

MIDLTQEKQEI LI KNKFLAKVFGLMS IGLLI SAVFAYATSENQTIKAI I FSNSMSFMAMI LIQFGLVYAI SGALNK 

ISSNTATALFLLYSALTGVTLSSIFMIYTQGSrVFTFGITAGTFLGMSVYGY^ 

LVNMFFRSSGLNFLISILGVVIFTGLTAYOTQNISKMDKMLQDOT 

RRND 

t368.aa 

TSENQTIKAIIFSNSMSFMAMILIQFGLVYAISGALNK^ 

ITAGTFLGMSVYGYTTTTDLTKMGS YL IMGLWGI 1 I AS LVNMFFRS SGLNFL I S I LGWI FTGLTAYDVQNI SKMD 
KMLQDOTEIKNRMAVVASLKLYLDFINLFLYLLRFLGQRRND 



f368.nt 
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ATGATCGATTTAACACAAGAAAAACAAGAAATACTAAT^ 

CAATTGGACTTTTAATCTCAG^ 

AAATTCAATGTCATTTATGGCTATGATACT^^ 

ATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTACTCAGCACTAACAG 
TGATTTACACACAAGGATCAATAGTATTCACATTC 

ATACACTACT^CAACAGATCTAACAAAAATGGGAAGCTATTTAATAATGG 

CTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTTC 

TAAGAGCTTATGATGTTCAAAATATT^ 

GGCGGTTGTAGCCTCACTTAAACTTTATTT^ 

AGAAGAAACGATTAA 

t368.nt 

ACOTCAGAAAATCAAACAATCAAAGCAATAATATTC 

TTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAAATATCAA^ 

CTCAGCACTAACAGGAGTAACATTATCTTCTATATTTATGATTTACACACAA^ 

ATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATGGATACACT 

ATTTAATAATGGGCTTATGGGGAATCATTATTGCA 

CCTTATATCTATTTTGGGCGTAGTT^^ 

AAAATGCTACAAGACGACACTGAAATAAAAAACAGAATGGCGGTTC 
TAAATTTATTCTTATATCTTCTAAGATTTTTGG 

f371.aa 

MKFFFLLQIALILLSNSSLLFGQSPPKEKEDSLLLYKEGKFKEAILNTL^ 

RAEKEAI IGLGI KKHDIRI I QALGEAYF FQKNYDNALK YFQEY I SLDSKGARI IKVYNLIADSFYELKRYNEADFA 
YEHALRFSPNNQNLLIKLARSRINAKNKILAEEALIKILTISPNNLEAKNLLEE^ 

t371.aa 

EDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIAIGEYKR^ 
QKNYDNALKYFQEYI SLDSKGARI IKVYNLIADSFYELKRYNEADFA^^ 
LAEEALIKILTISPNNLEAKNLLEELKKSNNKP 

f371.nt 

ATGAAATTTTTTTTTCTATTACAAATA 

CTAAAGAAAAAGAAGACrTCTCTTCTTCTATATAAAGAAGGAA 

AATTCGACTAAATCCTAGTAACTTAGATGCTAGGACAATATTG^ 

AGAGCTGAAAAAGAGGCGATTATAGGACTTGGCATTAAAAAAC 

CTTATTTCTTTCAAAAAAATTATGACAATGCAT^^ 

AAGAATAATAAAAGTTTATAATTTAATTGCAGATTC 

TACGAACATGCATTACGTTTTTCTCCTAATAACCAAAATCTATO 

AAAATAAAATATTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTC 

ACTAGAAGAATTAAAAAAAAGCAACAACAAACCTTGA 

t371.nt 

GAAGACTCTCTTCTTCTATATAAAGAAGGAAAATTTAAAGAA 

ATCCTAGTAACTTAGATGCTAGGACAATATTC 

AGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATO 

CAAAAAAATTATGACAATGCATTAAAATACTTTC 

AAGTTTATAATTTAATTGCAGATTCTTTTO 

ATTACGTTTTTCTCCTAATAACCAAAATCTATTAATAAAATTAGCA^ 

TTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTCCTAATAATC 

TAAAAAAAAGCAACAACAAACCTTGA 
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f502.aa 

MKKANFLSTNFLILLLVCFVNVNL^ 

LNESEIEDEAIYLGLTYNVKLNDFFYFKSELARSIS 

VNTIFLADNSQELVliALKNDKVDYIYGIX^ 

FSYLMKMPEELWSFLDSNAKGSFVDVGLYMD^ 

SLIXSKS^IFGGIISNDSVLENVN^ 

IVNNSYGFIENSITTKYLLKLNGYNGRL^^ 

LDKEEINSSRINSYKIMNKVKFNIEEKIWLSKNNKL^^ 

KVHSSLDIJCKLIKSGKIDMLNTNATDSl^^ 

KSKLILVSSFNEALIJ^YKGKVIXSIISDEYT^ 

MRSNVDSQMYLNDWKFDI YYKSRS IRFKNFKFLVITFI IFYFTFLGFVI I FMFRLSFEQKRRYSFVMNEKKI AEAA 

NAAKTIFJJ^SHDIRTPINGIMAATELLI^ 

IDLESEMEMVLKAFQSQCAKKNIDLFSYSKSIFN^ 

IXSNRVLVTVEFKVIDTGKGIEKEireSKIFEIF 

FSFMLPFLLGSELKSKKLSINRFQSWGDNKVLNV^ 

YNFVYINVNNDNIQEGIRIANNIERI^SDVQ 

EDLSQKIDSALRIKEPINVIjI AEDNQVNQKVLKDI LWIGINENFIDWDDGVKALKSLKDKKYTI SFIDIRMPRYD 
GF SVAKE IRKFEKAKNLK PCVLVAVTAHALQEYKDKCLASGMNDYI SKPIHI SSIKTILKKYLQFEVDDIGENENL 
NQLVKFPNLDVNRAI^IiNLSYVSYSEI/:RGLVDFISINIIDLEKAFD 
FQKIOTSKDSISELKKMYSFVKDDLFQLISDIKENILFESEIV^ 
EILESINKYVHjDDNIQVLFSDLRRNLRL^ 

t502.aa 

CFVNVNLFSKDIFKFKLVDQFFPFYYKNNKGEYEGLIFSILDKWAKDN^ 

NVKLNDFFYFKSELARSISILFFKNSNKKYKNra 

KNDKVDYIYGIXrKTIiHYIANNFLSEDLVIFTGDVFYSIKNRV 

SNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDL^^ 

SVLENVNYVVSKPIYPLNFKFYSKDLSNDAGPINSQ 

LLIOiNGYNGRLKSYDSIFNKNRFLVIiAIDNRIYKVIKY^ 

NKVKFNIEEKIWLSKNNKLNLAVKNW^ IKVHSSLDLKKLIKSGKI 
DMLNTNATDSNI^NVPNIKIjNSRIPLYIFSNKKRVLPSRS 

YKGKVDGI I SDEYTAAAVFEELNIDDVEKI PTFRDLAFDLSLAI YNQDYI LKEI IQKWMRSNVDSQMYLNDWKFD 

IYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRLSFEQKRRYSFVMNEK^ 

PINGIMAATELLDTTILTDV^KDYVRMINYSSDSLI^LIDD 

CAiaOJIDI^SYSKSIFNNYIKGDIVKIKQVLINLI^^ 

KGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRL^ 

LSINRFQSVNGDNKVLNVLLSQKSIKIFE 

RLANNIERLNSDVQI I FLFYYLDNIUVLKNLKYGYVKKPLMGLGICS ILYKKEFNPEMDFEDLVPIDSALRIKEPIN 

VLIAEDNQVNQKVLKDILWIGINENFIDVVDIX5VKAI^SIJC^ 

KPCVLVAOTAHALQEYKDKCLASGMNDYI^ 

LNLSYVSYSELCRGLVDFISINIIDLEKAFD^ 

YSFVKDDLFQLI SDIKENI LFESEI VSENKLYFKNNDQFLNLLNKLLIGIKTRKPREYKEI LES INKYVLDDNIQV 
LFSDLRRNLRLYRFAESSKILEEI I EMLNNKRY 

f502.nt 

ATGAAAAAAGCAAACTTTTTAAGTACTAATTTTT^ 

CTAAGGATATTTTCAAGTTTAAGCTTGTA^ 

AGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGAT^ 

TTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACT^ 

ATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTTAT^ 

TTCAAGATTTTTATCCAATTTTAATATAGC^ 

GTTAACACCATTTTTTTGGCTGATAATTCTCAAGAGTTA^ 
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ATGGTGATTGCAAGACTTTACATTATATTGCA 

TTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGAT 

TTTTGATATTTAATGAAAATGCCTGAGGAACTO 

ATGTTGGTTTATATAATGATTATCCTCCTTO 

TTTGTGGAATCITCTCTCAAGACAACATATCTTT 

TCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTA 

TAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTC 

TTCTCAGTTTATTGATTTTAATTTT^ 

ATTGTTAATAATTCATATGGGTTTATAGAAAATT^ 

GTAGATTAAAATCTTACX^TTCGATTTTTAATAAAAATAGGT^ 

GGTTATTAAATATATTCTCAATTCTA 

TTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATGAATAAGGTTA 

AAAAAATTTGGTTATCAAAAAATAATAAATTAAATC 

AAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTA 

AAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATCGATAT^ 

ATTCAAATTTAGATAATGTTTTCAACAT^ 

GGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACnTC 

AAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAG^ 

TTAGCGATGAGTATACAGCTGCroCTGTTTTTGAGGAATTAAAT^ 

AGATTTGGCTTTTGATTTGAGTCTTGCTA 

ATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGAT^ 

GGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATO 

TATGTTCAGATTATCATTTGAGCAGAAAAGAAGATATTC^^ 

AATGCTGCTAAAACCATTTTTATAGCC^ 

CTGAGCTTTTGGATACAACTATTCTTACAGATGTTC 

TTTGCTTTCTTTAATTGATGATATATTGTATTTC 

ATTGATTTAGAGAGTGAAATGGAAATGGTTTTAAAAGCTT^ 

TCTCTTATTCTAAATCTATTTTTAATAATTATATAAAGGG 

AATAGGAAATGCTTTTAAGTTTACAGATGATGGTGTTATTC 

GATGGTAATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGA 

CTAAGATATTTGAAATATTTAAACAAGAGGATGATTC 

AATATCTAGAGAGCTTATAAGACTAATGGGTGGTCT^^ 

TTTTCATTTATGTTGCCCTTTTTATTGG^ 

TAAATGGTGACAATAAAGTATTAAATGTGCTTTTAAGTCAAAAATC 

ATTGGGATGCTCTTCTAATGTGCGCTATGTAGCGTCTTTTGA^ 

TATAATTTTGTTTATATAAATGTAAATAACGATAATATTCAA 

TAAATTCTGATGTACAAATTATTTTTTTATTTTA 

TGTTAAAAAGCCTTTAATGGGGCTTGGTATATGCTCTATT^ 

GAGGATTTGGTTCCAATAGATAGTGCTTTAAGGATAAAAGAGCCCAOT 

TAAATCAAAAAGTGTTGAAAGATATTCTTGTTGTTATAGGCATTAATC 

AGTAAAGGCTTTAAAATCTTTAAAAGATAAAAAATATACT 

GGATTTTCGGTGGCTAAGGAAATTAGAAAATTTGAAAAGGCAAAG^ 

CAGCGCATGCTTTGCAAGAGTATAAAGACT^GTC 

CATAAGTTCAATTAAAACTATATTAAAAAAATACT^^ 

AATCAACTTGTTAAGTTTCCTAATTTAGATGTTAATAa 

CTGAATTATGTAGAGGGCTTGTTGATTTTATCTCTATTAATAT^ 

TTTGTCTTTAATTAAGGATATATCTCATTCAATATC 

TTTCAAAAAATTGAAAGAAGTAAAGATTGAATO 

TTCAACTAATAAGCGACATAAAGGAAAATATTTTGT^ 

AAATAATGATCAATTTTTAAACCTTCTCAACAAAC^ 

GAAATTCTTGAGAGCATTAATAAATATGTTTTAGACGATAATATTC 

TAAGArTATATAGATTTGCTGAGAGCTCTAAGATTClTCAAGAGATT 

G 



t502.nt 
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TGCTTTGTCAACGTCAATTTATTTTCT 
ACAAGAATAATAAAGGAGAAT^ 

TATTATGGTTGAGCATATTGATAATTTAAATGAAAGTGAAATT 

AATGTAAAATTAAATGATTTTTTTTATTTTAA^ 

OTAATAAAAAATATAAAAATACCCATTCAACATTTTTATCCAAT^ 

TGAAGATATCTTAAGGTTAAAAAACGTTAACACCATTTTTTT^ 

AAAAACGATAAAGTTGATTATATATATGGTGATTGCA^ 

ATCTTGTGATTTTTACCGGGGATGTTTTTTATAGT^ 

AGTAAAGAATTTGAATTTAGATTTGTTTTCATATOT 

AGCAATGCTAAGGGAAGTTTTGTTGATGTTGGTTTATATAATC 

GAAAATTGTCTGGCATTTTAGTGGATTTGTGGAATCTT^ 

ATTTTCCAAAGAGGATATTAAGAAATCATTAGATGGAAAATC^ 

AGTGTGTTGGAAAATGTTAATTATGTAGTAAGTAAGC 

TAAGCAATGATGCTGGTCCAATAAATTCTCAGTTTATTGAT^ 

AGATATTGTTAATAACTTTATAGATATTGTTAATAATTCATATG 

TTGTTAAAATTAAATGGATATAACGGTAGATTAAAATCTTACGATTCG^ 

TAGCCATTGATAATAGGATTTATAAGGTTATTAAATA 

TTTGCTTCAAATAGATAAAAATTGGTTGGATAAAGAAGAGAT^ 

AATAAGGTTAAATTTAATATAGAAGAAAAAATTTGGTTATCAAAAAATA^ 

GGTATCCAATAGATTATGTTGAGGCAAATAATTATAAAGGAATAAA 

TTCAGGTTTGAGATTTAACATAATTAAAGTACACAGCAGTT^ 

GATATGCTAAATACTAATGCAACCGATTCAAATTTAGATAA 

TTTATATTTTTTGAAATAAGAAAAGGGTGCTTCG^ 

TAGTAAAAATTTGGCTTCTAATATTAAATCAAAGC 

TATAAGGGAAAGGTAGATGGGATTATTAGCGATGAGTATA^ 

ATGTTGAAAAAATTCCTACTTTTAGAGATTTGGCTT^ 

GAAAGAAATTATTCAAAAAGTTGTTATGCGTTCAAATGTTGACAGTC 

ATTTATTATAAATCCAGAAGTATCAGGTTTAAAAATTTCAAAT^ 

CTTTTTTAGGATTTGTAATTATATTTA 

TGAAAAAAAGATTGCGGAAGCCGCTAATGCTGCTAAAACCAT^ 

CCTATTAACGGAATAATGGCGGCTACTGAGCITTTGGATAC 

GGATGATAAATTATTCATCTGATTCTTTGCTTTCTTTAAT^ 

TGAATTATATGTTGAGAGTCAAGAGATTGATTTAGAGAGTGAAATGGA 

TGTGCAAAGAAAAATATTGATTTATTCTCTTATTC 

AAATTAAACAAGTTTTAATTAATTTAATAGGAAATGCTTT^ 

TGAAGAAGTATGTAGAACAAGAACTGATGGTAATAGGGTTTTTC 

AAAGGGATTGAAAAAGAAAATTTTTCTAAGATATT^ 

ATGAAGGTGGAGGATTGGGATTGTCAATATCT^ 

TAGCAAGGTGGGAGAGGGTACAACTTTTTCATTTATGTTG^ 

TTGTCAATGAATAGATTTCAATCAGTAAATGGTGACAA^ 

AAATTTTTGAGCACTGTTCGATTTTATTGGGAT^ 

TAAAGTCTTCAAGAAATACCCTTCTTATAATT^ 

CGACTTGCCAATAATATTGAAAGACTAAATTCTGATC 

CTCTAAAAAATTTAAAATATGGTTATGTTAAAAAGCCTTT 

AGAGTTTAACCCAGAAATGGATTTTGAGGATTTGGTTC 

GTTTTAATAGCTGAAGATAATCAGGTAAATCAAAAAGTC 

ATTTTATTGATGTTGTAGATGATGGAGTAAAGGCTTTAAAATC 

TGATATACGAATGCCAAGATATGATGGATTTTCGGTGGCTAAGGAAATO 

AAGCCTTGTGTTTTGGTTGCTGTAACAGCGC 

ATGATTATATCTCAAAACCAATACACATAAGTTCA 

TGATATTGGGGAGAATGAAAATTTGAATCAACTTGTTAAGTTTC 

TTAAATCTTTCATATGTATCATATTCTGAATTATC 

TGGAAAAAGCTTTTGATGAGGAAGATTTGTCTTTAATO 

TATGCGTAGCGAATTGTATAAAGATTTTCAAAAAATTGAAACAAG 

TATTOTTTTGTAAAAGATGATTTATTTCAACTAA 

TTAGTGAGAACAAGCTATATTTTAAAAATAATGATCAATTTTTAAA 
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GACTAGAAAGCCAAGAGAATACAAAGAAATTCTTG^ 

TTATTTAGTGATCTTCGCAGAAATTTAAGAOT 

AAATGCTTAATAATAAGAGATATTAG 

f527.aa 

MNLLVKIAKFILILFLFTSCNQKQSEIQNL^ 
SREYYPLYLYLMGNIYDSMGEDFVAFm 

DNLNNEEKGNYFYNLALSLEDVQDYDESYFYYKKFLSIPRAHLKIDSra 

DVKNFVLSGNTSKLLNIRDKNNFFIQSWIX2KGGKSNSINTO 

SRGWDHIHEWYFVFKRIVYPKDPEINNGWTWIGVYLGKK 

t527.m 

CNQKQSEIQNLTHLLKSSNKNRLDKFLI IDRWNI YIANKNYEDALEIVNNGI IDDESREYYPLYLYLMGNI YDSM 
GEDFVAFNIYKRVVDNFDDYVYENHSMKTRVAKKIVNLNIDSIDKINYYKFIL^ 

EDVQDYDESYFYYKKFLS I PRAHIJCIDSRDYFNVVTKINYFNNPEFVVYRNLGDLI QDVKNFVLSGNTSKLLNI RD 
KNNFFIQSWDQKGGKSNS INTNSFLTTMIRLGGRRKNGIQFAKHLEADSSDDIS YLESRGWDHI HEWYFVFKRIVY 
PKDPEINNGWTWIGVYLGKK 

f527.nt 

ATGAATCTATTGGTCAAAATTGCTAAATTTATTTTGAT^ 

AGATTCAAAATCTTACACATCTTTTAAAATCTT^ 

TGTTAACATATATATTGCAAATAAAAATTATGAAGATGCTTTAG 

TCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAAT^ 

TTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATT 

TAAAAAGATTGTCAATTTAAATATTGATTC 

GATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGC 

ATGAATCTTATTTTTATTATAAAAAATTTCTTT^ 

TAATGTTGTTACAAAAATTAATTACTTTAATAATCCAGAGTTTGT^ 

GATGTTAAAAATTTTGTTCTTTCTGGTAATACT^ 

AAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAA 

GGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTT^ 

TCAAGGGGCTGGGACCATATTCATGAATGGTATTT^ 

ATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

t527.nt 

TGC^ACCAAAAGOVAAGCGAGATTCAAAATC^ 

TTCTTATTATTGATAGAGTTGTTAACATATATATTGCAAA 

TGGAATTATTGATGATGAATCTAGAGAATATTATCCTTTC 

GGAGAAGATTTTGTAGCTTTTAATATTTACAAGCGTG 

CAATGAAAACAAGGGTTGCTAAAAAGATTGTCAATTT^ 

TATATTAAATATGGGGATTGATAATTTAAATAATGAGGAAAAGGGTAATT 

GAAGATGTTGAAGATTACGATGAATCTT^ 

TAGATTCTAGAGACTATTTTAATGTTGTTACAAAAATTAAT^ 

TTTAGGAGATTTAATCCAGGATGTTAAAAATTTTGTT^ 

AAGAATAATTTTTTTATTCAAAGCTGGGATCAAAAGG^ 

CCACTATGATTAGGCTTGGGGGGAGAAGAAAAAAC 

CGATATATOTTATCTTGAGTCAAGGGGCTGGGACC^ 

CCTAAAGATCCAGAAATTAATAATGGCTGGACTTGGATAGGCCTGT^ 

f541.aa 

MNKILLLILI^SIVFLSCSGKGSLGSEIPKVSLIIIXSTFDDKSFNESAIiNGVKl^ 
LEGLKDAGSDLIWLIGYRFSDVAKVAALQNPDMCT 
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KIGFLGGIEGEIVDAFRyGYEAGAKYANKDIKI STQY IGSFADLEAGRSVATRMYSDEIDI IHHAAGLGGIGAIEV 

AKELGSGHYIIGVDEIXJAYIJ^DNVITSTTKDVGRAIiNIFTSNHLK 

ELEKEIDNLSSKI INKEI IVPSNKESYEKFLKEFI 

t541.aa 

CSGKGSLGSEIPKVSLIIIX3TFDDKSFTIESALNGVKKVKEEFKIELVLKESSSNSYL 

RFSDVAKVAAI^NPDMKTAIIDPIYSNDPIPANLVC^^ 

YGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDII 

AYLAPDNVITSTTKDVGRALNIFTSNH^ 

I IVPSNKES YEKFLKE 

FI 

f541.nt 

ATGAATAAAATATTGTTGTTGATTTTGCTTGAGAG 

GCGAAATTCCTAAGGTATCTTTAATAATTGATGG^ 

CGTAAAAAAAGTTAAAGAAGAATTTAAAATTGAGCTTGTTTO 

CTTGAAGGGCTTAAGGATGCGGGCTCAGATTT^ 

CGGCTCTTCAAAATCCCGATATGAAATATGCAAT^ 

GGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATT 

AAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGT^ 

ATGCTAATAAAGATATAAAGATATOTACTCAGTATATTGGTAGTTTT^ 

AACTAGGATGTATTCTGATGAGATAGACATTATT^ 

GCAAAAGAACTTGGTTCTGGGCATTACATTATT^ 

TAACATCTACAACTAAAGATGTTGGTAGAGCTTTAAATATTTTO 

AGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGG^ 

GAACTTGAAAAAGAAATTGACAATCTTTCTAGCAA 

GTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

t541.nt 

TGTAGTGGTAAAGGTAGTCTTGGGAGCGAAATTCCT 

CTTTTAATGAGAGTGCTTTAAATGGCGTA^ 

CTGATCAAATTCTTATTTATCTGATCTTO 

AGATTTAGCGATGTGGCCAAGGTTGCXSGCTCTTC^ 

CTAACGATCCTATTCCTGCAAATTTGGT^ 

TGCTGCAAAACTTTCTAAAACAGGTAAAATTGGAT^ 

TATGGGTATGAAGCTGGTGCTAAGTATGCTAATAAAGAT^ 

ACCTTGAAGCTGGTAGAAGCGTTGCAACTAGGATGT^ 

TGGAGGAATTGGGGCTATTGAGGTTGCAAAAGAACTTOT 

GCATATCTTGCTCCK^CAATGTAATAACATCTACAACTAA 

ACCATTTAAAAACTAATACTTTCGAAGGTGGCAAATTAATAAATTATGGCCTTAA 

AAGAAATCCTAAAATGATTTCCTTTGAACTTGAAAAAGAAATTC 

ATTATTGTTCCATCTAATAAAGAAAGTTATGAGAAGTT^ 

f561.aa 

MYKNGFFKNYLSLFLIFLVIACTSKDSSNEYVEEQ^ 

KVFYFSEEDGHFQTI PSKENAKLI VYFYDNVYAGEAPI S I SGKEAF I FVGITPDFKKI INSNLHGAKSDLIGTFKD 
LNIKNSKLEITVDENNSDAKTFLESVNYI IDGVEKI SPMLTN 



t561.aa 
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CTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGH 

KLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDI^IK^ 
FLESVNYIIDGVEKISPMLTN 

f561.nt 

ATGTATAAAAATGGTTTTTTTAAAAACTATTTC 
ATAGCTCAAATGAATATGTTGAGGAGK^AAGAAGCGGAGAACTCTTC 
TACTATTGGGCACGTTTTTCACGCTATGGGAGTAG 
AAGGTTTTTTATTTTTCTGAAGAAGAT^ 

ATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATC 
TACCCCTGACTTTAAAAAGATTATAAATAGC^ 
CTOAATATTAAAAATTCAAAATTGGAAATTACAGT 
TTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAA 

t561.nt 

TGTACTTCAAAAGATAGCTCAAATGAATATGTTGAGGAGCAAG 

AAATAGATGAACATACTATTGGGCACGTTTT^ 

GGGGAAAAATATAAAGGTTTTTTATTTTTCTGAAGAAGATG 

AAGTTAATAGTTTATTTTTATGACAATGTTT^ 

TTTTTGTTGGGATTACCCCTGACTTTAAAAAGATTATAAATAG 

TACTTTTAAAGATCTTAATATTAAAAATO 

TTCCTTGu^ATCTGTTAATTACATTATCGACGGCGTTGA 

£604. aa 

MSFNKTKKIGKKIKrVTLLMLAVSL^ 
KTTCKIJCPGLAKNWEASKDKiCTYQFY^ 
KVSDSELGIKAIDSKTLEITLTAPKPYFLELLLHYAFM^ 
FEKNERYYNAKEVELDELVYI TSDOTLTVYNMYKNNEIDAIFNS IPPDI^ 
KPLDDARTOEALTLAIDRETLTYKVIjNTC 

LKYNTNETniKKIAAFIQNQWKKILNINLMLTNENWP IRVGRIGEYLDPHTYFTIFTRENSQLA 
SYGYSNLEFDKLIRESDLEKDPIKRKQLLRKAESI I 1 EKDFPAAPI YI YSGHYLFRNDKWTGWNPNVSEVYYLSEL 
KPIKNAKHN 

t604.aa 

CNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLOT 

DNLFWSIXSVEITAEGII^SFLRIIjNKETGSTNVDMLKSIIKN 

FLELIJiHYAFMPVPIHVIEKYKGNWTSPENMVTSGPFKIi^ 

TVYNMYICNNEIDAIFNSIPPDIVNEIKLQK^ 

TCTVPTREITPDLKNYNYGKKLALFDPEKSKKIJJU^ 

LMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFT 

LLRKAESII IEKDFPAAPI YIYSGHYLFRNDKWTGWNPNVSEVYYLSE 

f604.nt 

ATGAGCTTTAATAAAACTAAAAAAATCGGTAAAAAAATTAAAATAGTAAC^ 

TTGCATGCAATAATAATTCAGAAAAAGAAAAATTAGCAT^ 

CCCTCATTTGGTAGATGAGACAATAGGAGCAAGAATTTO 

AAAACAGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGA^ 

TAAGGGACAACCTTTTTTGGAGCGATGGAGTTC 

AAATAAAGAAACAGGATCTACAAATGTTGACATGCK^^ 

AAAGTATCXGATTCTGAACTTGGAATCAAGGCAATTGATAOT 

CATATTTTCTTGAACTGCTTOTACATTACGCAT^ 
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TTGGACAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTA 

TTTGAAAAAAACGAACGTTATTATAATGCAAAAGAAGT^ 

ATCTTACTKSTGTACAATATGTACAAAAACAACGAAATTGATGCTAT^^ 

TGAAATAAAACTACAAAAAGACTATTACCAACACA^ 

AAACCCCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCIT 

TAAATGATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAA 

ATTTGATCCTGAAAAATCTAAAAAGCTTTTGGCA^ 

CTAAAATATAATACAAACGAAACTCATAAAAAAATT^ 

TCAATCTTATGCTTACCAACGAAAAOTCGCCTGTTCTTAC^ 

AGTTGGACGCATTGGGGAATATTTAGATCCACACACATACTT^ 

TC^TACGGATATTCAAACCTAGAATTTGACAAAC 

AACAATTACTCAGAAAAGCAGAATCAATAATAA 

GCATTATCTTTTTAGAAACGATAAATGGACTGGATGGAATC 

AAACCAATTAAAAATGCAAAACATAATTAA 

t604.nt 

TGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTC 

ATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAA^ 

AGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGAT 

GACAA(XOTTTTTGGAGCGATGGAGTTGAAATT 

AAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAA 

ATCCGATTCTGAACTTGGAATCAAGGCAATTGAT^ 

TTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCT^ 

CAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAA 

AAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTG 

ACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTOT 

TAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATT^ 

CCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGA 

GATGGCAGAGTTCCTACAAGAGAAATAACTCCTGATCTTAA 

ATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATC 

ATATAATACAAACX3AAACTCATAAAAAAATTGC 

CTTATGCTTACCAACGAAAATTGGCCTGTTCTO 

GACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTC^ 

CGGATATTCAAACCTAGAATTTGACAAACTC^TCAGA 

TTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTC 

ATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATC 

AATTAAAAATGCAAAACATAATTAA 

f736.aa 

MKKVT ILIFMLSTSLLYNCKNQDNEKIVS IGGSTTVSPILDEMI LRYNKIIOJNTKVTYDAQGSSVGINGLFNKI YK 

IAISSRDLTKEEIEQGAKETVPAYDALIFITSPEIKITNITEENLAKILNGEIQ^ 

SYSSIKDLLLNKIFKTHEEAQFRQDGIW 

INSNKYTIKRNLI I VTNNKYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 
t736.aa 

CKNQDNEKIVSIGGSTWSPILDmiLRYNM 
ETVFAYDALIFITSPEIKITNITEENLAKILNGEIQN^ 

EAQFRQDGIWKSNGEVI EKTSLTPHS IGYIGIK?yAKNSIEKGIiNILSVNSTYPTKETINSNKYTIKRNI*I IVTNN 
KYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 



f736.nt 
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ATGAAAAAAGTTATTATCTTAATTTTTATGCTATCAACAAGTT^ 

AAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAC^ 

CAATAATACTAAAGTAAC^TACGATGCACAAGGAAGTA^ 

ATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTG 

CTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATO 

TCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAAC^^ 

GAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAA 

TGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAA^ 

ATAAATAGCAATAAATACACCATTAAAAGAAATTT^ 

CTCAATTTATTGATTTCATGACAAGCTCAACTGGAGA^ 

ATAA 

t736.nt 

TGTAAAAATGAAGACAATGAAAAAATTGTATCAATTGGAG 

TTTTAAGATATAATAAAATAAACAATAATACTAAAGT^ 

GCTATTTAACAAAATATATAAAATAGCAATATCATG 

GAAACTGTATTTGCTTATGATGCTTTAATTTTCATO 

ATCTAGCTAAAATACTAAATGGAGAAATTGAAAAT^ 

GAAGCTCAATTTAGACAAGACGGAATAGTGGTAAAATCTAATGGAGAGGTAA 

ACTCAATAGGATATATAGGTCTTGGATACXX1AAAAAA 

CAGATATCCTACAAAAGAAACAATAAATAGCAATAAATA^ 

AAATACGAGGATAAAAGCGTAACTCAATTTATTGATTTCATC 

AAGGCTTTTTAGGGATAAAAACATAA 

f752.aa 

MNKKIJ^EVLLKLDQDLIKCVKGSLDLEISGVTYSSKLVLPRFV^ 
FYSPNVTYIKVDDFNIRKFMSNFSNIFYDEPSKKLKVT 

GSLIKNPYRQSTPESTEIHSFLSTMVKNEAQYAILESTSHGLDLETARLIDVN^ 

LNVKIiGLFRSVSDDAGFGVINLDDLYSSDFKNAVKKSFTYSL^ 

SLI^SFNVENVMAALILVSQII^IDIQDIVDKI^CIKSI^ 

TNRIilSVFGSAGERDVEKRFLQGQIADI YSDLI ILCDEDPRGENSMCI IKDIAKGIVNKVENKDIiFFIADRKQAI E 
KAISLAKAGDLVVALGKGHESSIIYKNREVFWNEQEW 

t752.aa 

CVKGSIJDLEISGVTYSSKLVLPRFWFALPGIHFIX5HDFIEIAIQKGSNWV 
FMSNFSNIFYDEPSKKIJtVIGKmSTTC 

HSFLSTMVKNEAQYAILESTSHGLDLETARLIDWNYFAWFT^ 
VimiDDLYSSDFKNAVKKSFTYSI^SSKADFFVSFIDEKTDSTRF^ 

SQILNIDI QDI VDKLNC I KSLDGRMDS INLGQNFSVI IDYAHTPGAFSKLFPI FKRFATNRLISVFGSAGERDVEK 
RFLQGQI ADI YSDLI ILCDEDPRGENSMCI IKDIAKGIVNKVENKDLFFIADRKQAIEKAI SLAKAGDLWALGKG 
HESSI IYKNREVFWNEQEWKNAILSLEKSEKEK 

f752.nt 

ATGAATAAAAAACTTAATGAAGTTTTATTAAAGTT^ 
TAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCC 
TTTTGATGGGC^TGATTTTATTGAAATT^ 
TTTTACAGTCOTAATGTTACTTATATTAAGGTAGATGACTOT 

TTATATATATCITCTTTTTAAAAAAAAGGGTC 
GGAAGCTTGATTAAAAATCCTTACAGACAATCAAC^ 
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TTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTC 

TGTTAATTATTTTGCAGTTGTTTTT 

TTGAATGTCAAGOTGGGTCTTTTTCGGTCTGTTAG 

ATTCTTCTGATTTTAAGAATGCTGTTAAGAAATC 

TAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATT^ 

AGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTT 

TTCAAGATATTGTTGATAAAOTTAACTGCATTAAAAGTCT^ 

TTTTTCTGTAATAATTGATTATGCTC^^ 

ACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGA 

CAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATC^ 

CATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATT^ 

AAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTG 

AAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTT^ 

GAAGTGA 

t752.nt 

TGTGTAAAAGGTTCTCTTGATTTAGAAATATCAGGAGTTACTTATAGTTC 
TTTTTGCTCTTCCAGGAATTCATTTTG 

TGTGTGTTCACGAGATGTGGATTTTTACAGTCCTAATGTTACTTAT^ 

TTTATGTCTAATTTTTCAAATATTTTTTATGAT^ 

A03GGAAAAGTTCTGTTTGTTATTATATATATCTTCTTTOT 

AGTATTTTTTGATGATGGGAGTGGAAGCTTG 

GATTCATTTTTAAGCACCATGGTTAAAAATGA 

TTGAAACAGCAAGGCTTATTGATGTTAATTATTTTGCAGTTC 

TCATGGCACAATTCAAAATTATTTGAATGTCAAGCTGGGTC 

GTTATTAATCTTGATGACCTTTATTCTTCTGATTTTAA 

GCAGTAAAGCGGATTTTTTTGTTAGTTTTATTGATGAGAAAA^ 

GGTTAAATATCTTGCTAATGTTAGCCTACTGGGGAGTTTTAATGTTG^ 

TCTCAAATTTTAAATATCGATATTCAAGATATTGTTGATAAACTTAA 

ATAGTATTAATTTGGGGCAAAATTTTTC 

TCCTATTTTTAAAAGATTTGCTACCAATAGATTGATTTCTC 

AGATTTTTGCAAGGGCAAATCGCAGATATTTATTCTGATT^ 

ATAGTATGTGTATAATTAAAGAGATTGCAAAAGGAAT^ 

TGATAGAAAGCAGGCTATTGAAAAAGCAATAAGTCTTGCAA 

CATGAAAGTTCAATAATTTATAAAAATAGAGAAGTTTTTTGG^ 

GTTTAGAAAAATCAGAAAAGGAGAAGTGA 

f798.aa 

MVFRTYKHLELIMLPMLMLSCAFFKKPQSVHQDSNTGKPISDEKLH 
LGEIX3KEIPEFKNKFGYSYIISPVKMIX3KYSYYASLLILFETTKI^GD 
QEEGYVTAYPFGILMSDEIKNAFKLTYKNGHWNYMLADLTVKNKL 
DIAGDLFEDI 

t798.aa 

CAFFKKPQSVHQDSNTGKP I SDEKLHLI SGKI SNKKLPI INSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYI 
ISPVKMIX3KYSYYASLLILFETTKNGDDEYEIEDVKFVTAGSTLELKNSLLAVENSQ 
NAFKLTYKNGHWNYMIJ^LTVKNKLTQETKI YKI SLNSKLI I EFLKEVLKENS ILKDIAGDLFEDI 

f798.nt 

ATGGTATTTAGAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTA^ 

AACCACAATCTGTAGATCAAGACAGCAA^^ 

AATTTCAAATAAAAAATTGCCAATCATAAATAGTA^ 
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TTAGGCX3AAGATGGAAAAGAAATACCAGAATTTAAAAACAAA 

TGGATGGAAAATATAGTTATTACGC6TCATTATTAATACTTTTTGAAACAACT 

AATTGAAGATGTTAAATTTGTAACAGCTGGTTC^ 

CAAGAAGAAGGATATGTTACTGCATACCCATTTGG^ 

CATATAAAAATGGTGATTGGAATTATATGCTTG 

TTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTT^ 

GACATAGCTGGAGATTTATTTGAAGATATATAA 

t798.nt 

TGCGCTTTTTTTAAGAAACCACAATCTGTAC^ 

ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAAT 

AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATT^ 

ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTC^ 

GAGATGATGAATATGAAATTGAAGATGTTAAATTTG 

AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCC^ 

AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTO 

CTCAAGAAACTAAAATTTATAAAATTTCTCTTAATTCAAAA 

AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATATAA 

£805. aa 

MLJlKLKDISKIVLVTIX5LTPNCQTCGKLIANGDEVyiAEIX^ 
QASSYNPTRILNIDKKGLICHGYDANLNVLDKDFNLKLTMIESKIIFNNL 

t805.aa 

CQTCGKLIANGDEVYIAEIXSLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAVQASSYNF 
GYDANLNVLDKDFNLKLTMI ESKII FNNL 

f805.nt 

ATGCTTAGAAAGCTTAAAGATATAAGTAAAATAGTCCTTGTAACTGACGGACTTA 
GAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGACT 
AGCTK^ATCAACACTCACAATGATACAAGGTC 
CAAGCAAGCTCTTAC^TCCAACAAGAATTCTCAA 

ACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATC 
A 

t805.nt 

TGTCAAACTTGTGGAAAACTAATTGCAAACGGAGACGAAGTTTATATT^ 

AAAGCAACACAATAGCTGGATCAACACTCACAATGATACAAG^ 

AAGCGATGCTGTTCAAGCAAGCTCTTACAATCC^ 

GGATATGATGCAAACCTCAATGTCCTAGATAAAGATTTTAATC 

TTAACAATCTCTAA 

f635.aa 

MKILWLIILVNLFLSCGNESKEKSNLGLRLRELEI^ 

LEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNL^ 

AISNMYVYDLDKQLEAKNYLNKLGDMGEDYFEFIiMIjRGA 

LK 



t635.aa 
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CGNESKEKSNLGLRLI^LEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFN 

PAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEV^ 

AKNYLNKI^DMGEDYFEFIJIIjRGAN^ 

f635.nt 

ATGAAAATTTTGTGGTTAATAATTCTTGTTAATOT 
TTGGTCTTAGATTAAGAGAATTGGAAATTTO 
TGAAAAAGAAGATAAGAATATTTTAAAGATAGTTAATTC 
CTTGAATTTTTTAAGCTTGGTCAGT^ 

AAAATACATAATTCTTGCTGAAAATTCTTTTT^ 

GCGATTTCTAATATGTACGTATATGATCTTGAT 

TGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCA^ 

ATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCAAAAAGAAGG 

TTGAAGTAA 

t635.nt 

TGTGGCAATGAATCTAAAGAAAAATCAAATCTTGGTC 

AATCTAAGATTGAAGTTTATAAAGAATTTATTC 

TAAGAAAGCCAGATTTTTTAATTTAATTGGTC^ 

TTTGCTAAAAATTTAGAAATCAATCCCAATAATTATTTATCTC 

OTAAAAATTTAAGAGTAAAAGATGAAGTTGAAAAATACATAATTCTTG^ 

AATTAGAGATGATTTTAAAGATTCTCTTTT^ 

GCTAAAAATTATTTAAATAAACTTGGTGATATGGGTGAGGACTAT^ 

ATTATTCGCTGGGCGATCTTGGTAATGCTATATTGTTTTATGAT 

AAAAGAAGGTGTTTCTAGGATCATGAGTAATTTGAAGTAA 

£314.aa 

MNNCLIKFFIFLLWSNSYVAFSKNVNVLI^ 

CGVGKVNAGVWTSYILSKYNI SHVINSGVAGGWSAKYKDIKVGDWVS SEVAYHDVDLTKFGYKVGQLTGGLPQK 
FNANKNLIKNAIEAIKSKVGGSNAYSGLIVSGDQFIDPTYINKI IGNFKDVXAVEMEGAAIGHVSHMFNI PFIVTR 
SISDIVNKEGNEVEYSKFSKIAAFNSAKWQEILRKLZ 

t314.aa 

KNVNVLIWAMDSEFDQINKLMSNKEEIVLKEYGL^^ ICGVGKVNAGVWTSYI LSKYNI SH 

VINSGVAGGWS AKYKDI KVGDVVVSSEVAYHDTO I KNAI EAI KSKVGGSN 

AYSGLIVSGIXJFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIRSIS 
FNSAKWQEILRKLZ 

f314.nt 

ATGAATAATTGTTTAATAAAGTTTTTTATTTTTOT 

TCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTC 

AGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTO 

TGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACA 

ATTCTGGCGTTGCTG^TGGCGTTGTTAGTGCTA 

GGTTGO^TATCATGATGTTGATTTGACTAAAT^ 

TTTAATGCCAATAAAAATTTAATTAAGAATGCCAT^ 

CAGGATTAATAGTTTCAGGAGATCAGTTTATTGATCCAACTTATAT^ 

AATAGCTGTTGAGATGGAAGGTGCAGCAATAGGGCATC 

TCAATATCTGACATTGTAAATAAAGAAGGGAATGAGGTO 

CAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 
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t314.nt 

AAAAATGTCAATGTTTTAATAGTAACT 

AAGAAATAGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGA 

TATTATTTGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACT 

GTCATTAATTCTGGCX3TTGCTGGTGGCGTTGTTAGTGCTA 

CTTCAGAGGTTGCATATCATGATGTTGATTTGACTAAATO 

TCAAAAATTTAATGCGAATAAAAATTTAATTAAG^ 

GCATATTCAGGATTAATAGTTTCAGGAGATCAGTTTATTG^ 

AAGATGTAATAGCTGTTGAGATGGAAGGTGCAGCAATAGO 

TATTAGGTCAATATCTGACATTGTAAATAAAGAAGGGAATGAGGTTGAATATACT 
TTCAATTCAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 

f32.aa 

MNTKTLYLISLILLACmNNKIPLIQKIJDLPKSSILGFSNKMGIIIKDYAFLSKST 
KIEKTLEKTERYGIEGNWILVNYKGTKRYIFSKDINIVNNLI IDHSKZ 

t32.aa 

CNKNNKI PLI QKLDLPKSSI LGFSNKMGI I IKDYAFLSKSTKKNSELDYDYAI LLRKDEWKI EKTLEKTERYGIE 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSKZ 

f 32. lit 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGC 

AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAA 

TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGACT 

AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATT^ 

CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTTAAT 

t32.nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGAT^ 

AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTA 

CGCAATTCTACra^GAAAAGACGAAGTCGTAAAAAT^ 

GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATC 

TAATAATTGATCATTCTAAATAG 

£320. aa 

MKSIYALLFLFINLSLLANNISKKDLEV^ 

PENYKI PDLVNIDDFEDLKNLGAKTIKVRKI LI EDLI RLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYNVKTYGRK 

VAETQSAIPGHSQHHMGTAIDFINIDDNLLNTKEGK^^ 

FIQKKYFNNLQHKLLEFWNQNKTNLINLIEKYANZ 

t320.aa 

NNISKKDLEVLIiKI AQAMNKECKNFIEKNPIQFLKEI KPLVDAEKNNLLTLINKKI PI PENYKI PDLVNIDDFEDL 

KNLGAKTIKVRKILIEDLIRLIKDAKKFGIEIKIKSAYR^ 

AIDFINIDDNLIJ^KEGKWLYENSLKYG 

NQNKTNLINL I EKYANZ 



f320.nt 
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ATGAAATCAATTTATGCTTTATTATTTCTATTTATO 

TAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAAT^ 

CTTAAAAGAAATAAAACCCTTAGTAGATGCAGAAAAAAATAACCTCTTAACTCTAAT 

CCTGAAAATTATAAAATACCTGATCTGGTAAATATTC^ 

TTAAAGTAAGAAAAATATTAATCGAAGATTTAAr^ 

AATCAAATCTGCTTACAGAACGCAAGAATATC^ 

GTTGCAGAAACCCAATCAGCAATTCGAGGCCATTCTCAACATC 

ATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCT 

CCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTTATACAT^ 

TTTATTCAGAAAAAATATTTTAATAATTO^ 

TTAACCTAATTGAAAAATATGCAAACTAA 

t320.nt 

AACAACATTTCAAAAAAAGATTTAGAAGTACTGCTAAAGATTGCCGAAGCA 

TTGAAAAAAATCCTATTCAGTTCTTAAAAGAAATAAAACCCTTAGTAGATC 

AATAAATAAAAAAATACCAATTCCTGAAAATTATAAAATACCTGATCTGGTAAATA 

AAAAATCTTGGAGCAAAGACTATTAAAGTAAGAAAAATATTAATCGAAGATTTAA 

AAAAATTTGGGATTGAAATTAAAATCAAATCTGCTTAC^^ 

TGTCAAAACTTATGGCAGAAAAGTTGCAGAAACCCAATCA 

GCAATAGATTTTATAAATATAGATGATAATTTACTAAACACAAAAGAAGGAAA 

AATACGGATTTTCCGTTTCATACCCAAAAGGATATGAAACGGAC^ 

ATACATAGGACCTAAGCCATGCTTTATTCA 

AACCAGAACAAAACAAATCTTATTAACCTAATTGAAAAAT^ 

f342.aa 

ML YLGDNKAMRTK 1 1 IMTI 1 1 LLAPI SGFSNSKESARGKFGAGI I LPLPI ALQINIGNFDLDIGLYSGVNNLFSDW 

KTLFIALDYIFYIYTFPGAANILDFSVGAGGYGTIWFSRFGG 

PGLGMNVWSNGVGFRWEVFAGLGLRFWFTZ 

t342.aa 

LAP I SGFSNSKESARGKFGAGI ILPLPIALQINIGNFDLDIGLYSGVNNLFSDW^ PGAANI 

LDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLALNIA 

GLRFWFTZ 

f342.nt 

ATGCTATACTTAGGAGATAATAAAGCAATGAGAACAAAAATAATTATO 

TCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGT 

AAAACATTATTTATAGCATTAGACTATATTTTC 

CAGTTGGCGCAGGGGGATATGGAACAATATGGTT^^ 

CATTGGAGCAAGATTGCCTTTGGCCTTAAA 

CCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGAT^ 

TCTGGTTTACTTAA 

t342.nt 

TTAGCCCCAATCTCAGGATTTTCTAATTCAAAAGAATC^ 

TACCAATTGCTCTACAGATTAATATAGGAAACTT^ 

TTGAGACTGGAAAACATTATTTATAGCATT^ 

TTGGATTTTTCAGTTGGCGCAGGGGGATATGGAACAATATC 

GACCAATGAGCATTGGAGCAAGATTGCCTTTGGCC 



WO 98/59071 



PCT/US98/12718 



159 

TABLE 1. Nucleotide and Amino Acid Sequences 

ACGAATAGCACCCGGACTTGGAATGAATGTTTGGA^^ 
GGACTAAGATTCTGGTTTACTTAA 

f352.aa 

MNKTKNRSLTYFIILSCISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSI^ 
KLPENIRDKKLPQKRMDENDLKSVIENYENKIKNIEKLLKTKNQKTSENENI^ 
EIKKLLNKKI KPKEDENYEKINI ENI EEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ . 

t352.aa 

CISLFGANNNTISYSSIEIPLEDLSEEFTCSSGNKSIX^INTSKHLNKNIVSYEDPKKGKDLKLPENI^ 

DENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTN^ 

NYEKINI ENI EEETDDDFEDNYEYNDEIEEQMRT ITLLMKEZ 

f352.nt 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTAT 

ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTA 

CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTA 

AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAA 

AAAATTATGAAAATAAAATTAAAAACATAGAAAAGC^ 

TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTO 

GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAAAATO 

TTGAAGAAGAAACTGATGATGATTTTGAAGAC 

TACCCTTCTAATGAAGGAATAA 

t352.nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTC 
AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTC 
TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAAT 
GACGAAAATG^TCTAAAATCTGTAATTGAAAATTA 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAA 
AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATC 
AATTACGAAAAAATAAATATTGAAAACATTGAAGAAG 
ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAA 

f301.aa 

MQIIXSKIYSIISFPVRDSVSTLGVIGILICFDESLD^ 

FSTAYSEOTLSKVIAYAKKDSSSSQYTFNYERDFYSIiNFVKTDDFLTQGLILN^ 
AIIFYLCNTFVFSLIlTOFTraiVDYQKSKSDPFSL^ 

EYLEQIETAISNTESIDSSILVYEQLROTFSRFEKSIVDILKGFESIADPINDHNKYISEISSN^ 
KNIJ3IFNKVATINSTDIENIKSKVFDLNIW 

GDAGKSFAVVAEEIR1GLAINSGKYSKTIKDELKTVDSIIAVINSEIOT 

HFKEIGEFKERYLSHDTKIRDAKNMYKEIFNNHYFISGKFNNFSQDLKEFKVSK^ 

ILKTKELIQKINDEIKDILFZ 

t301.aa 

CFDESLDIIENQLYSSLKFGSKNYNFFMLJDRNYMPIFS^^ 

YERDFYSLNFVKTDDFLTQGLILNW^ 

DPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKI 

FSRFEKSrVDIIJCGFESIADPINDHNKYISEISSNFEESVSFFYSIDKNLEIFNKVATI^ 

VFENVNKNFADLLSQTNSLQSVNKLLVS I SAQTNMLAMNAAIEAAKAGDAGKSFAWAEEIRKLAINSGKYSKTIK 
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DELKTVDSIIAVINSEIDTIYKNFIDIQD^ 

FNNHYFI SGKFNNFSQDLKEFKVSKMNLDAVSSLQEYS SLVKS SKDKI liKTKELI QKINDEIKDI LFZ 
f301.nt 

ATGCAAATAGATGGGAAAATTTATTCTATAATAAGTTTTCCAGTTAGAGATTC 

GGATTTTAATATGCTTTGATGAGTCGTTAGATATTATTGAAAATC 

AAATTATAATTTTTTTATGCTTGACAGAAATTACATGCC 

TTITCTACAGCTTATAGTGAGAATTTTTTGAGTATWVGT^ 

ACACTTTTAATTATGAAAGAGATTTTTATTCTTO 

TTTAAATGTCAATTCCATTCCTATTATGTTTAAATCAAATTGGG 

GCAATTATTTTTTATTTATGCAATACTT^ 

AATCAAAAAGCGATCCTTTTAGTCTTGAATCTCCCTO 

TTGAAAGCTAGATAATCTGTCTTCTAAGAGTAATGAA 

GAATATTTGGAAGAAATAGAAACTGCTATATCAAATACTGAGAGTATAGA 

TAAGAGATACTTTTTCTAGATTTGAAAAATCAATTC 

TAATGATCACAATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGA 

AAAAATTTAGAAATTTTTAATAAGGTTGCTACTATAAATTCTACTGATA 

ATTTAAATATTGTTTTTGAAAATGTGAATAAAAATTTTGCA 

AAATAAACTTTTAGTTTCAATTTCAGCTCAGACCAATATGCTTGCT^ 

GGTGATGCAGGTAAAAGTTTTGCAGTTGTTGCTGAGGAGAT^ 

AAACC^TTAAAGATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTC^ 

AAATTTCATAGACATTCAAGATAATGTGGACAACAATTT^ 

CATTTTAAAGAAATTGGCGAGTTTAAAGAAAGGTATTTGTC 

ATAAAGAAATATTTAATAATCATTATTTTATTAGTGGCAAGTTTAA^ 

AGTTTCTAAGATGAATTTAGATGCGGTAAGTTCTCTTCAAGAATAT^ 

ATATTAAAGACAAAGGAATTGATTCAAAAGATTAATC^ 

t301.nt 

TGCTTTGATGAGTCGTTAGATATTATTGAAAATC 

TTTTTATGCTTGACAGAAATTAGATGCCCATTTTTT^ 

TTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAA 

TATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGAT^ TTATTTTAAATGTCA 

ATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTC 

TTATTTATGCAATACTTTTGTTTTTTCATTAA 

GATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATT^ 

ATAATCTGTCTTCTAAGAGTAATGAATCTTT^ 

ACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATT^ 

TTTTCTAGATTTGAAAAATCAATTGTTGA 

ATAAATATATATCAGAAATCTCTTCAAATTTT^ 

AATTTTTAATAAGGTTGCTACTATAAATTC^^ 

GTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTCT 

TAGTTTCAATTTC^GCTCAGACCAATATGCTT^ 

TAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCT^ 

GATGAACTTAAAA(^TCGACAGCATTATTGCAGTAATTAATTCA 

ACATTCAAGATAATGTCGACAACAATTTTTC^^ 

AATTGGCGAGTTTAAAGAAAGGTATTTGTCTCACGATACTA^ 

TTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTO 

TGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATC 

AAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCT^ 



f346.aa 
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MSIDKVPDEAFAEKIVGTCIAILPTSNEL^ 
AEEGINVKQGEVIIRLDLEYLKEHSESV^ 
P VI AGKDLVLRVKK Z 

t346.aa 

CIX3KIGKIFKTNHAFSLETKEGVEIFVHFGIOTLNI^GKGFTRVAEEGI 
VIANSDEVS S IEYSFGRLENDSEYI LSSSTVLTEEIRHKI SQTKPVIAGKDLVLRVKKZ 

f346.nt 

ATGTCAATTGATAAGGTTCCCGATGAAGCTTT^ 

GCAATGAGTTGTTGGCGCCTTGTGATGGGAAAATAGGTAAAATT^ 

TAAAGAGGGCGTTGAAATTTTTGTCCATTTTGGAA 

GCTGAAGAGGGCATTAATGTTAAACAAGGTGAAGTTATTATTAGGCTT^ 
AATCCGTTATTACTCCGGTTGTTATTGCAAATTCTGATGAAGTTTC 
AAATGATTCTGAATATATTTTATCATCTTCAACTGTCTTGACAG 
CCTGTTATAGCGGGCAAAGATTTGGTGTTGCGAGTTAAAAAGTAA 

t346.nt 

TGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTT^ 

TTGTCCATTTTGGAATTAATACTCTTAATTTAAATGG 

TAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAA^ 

GTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTC 

TATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATC 

TTTGGTGTTGCGAGTTAAAAAGTAA 

f373.aa 

MNYQRIKNYCKFTSVFLFFLFSCVSNELKLDQSLVKGKIiVNGLRYYIYKNQTP 
IAHYLEHMAFNGTKDYPGNSIVDVLKKFGMQFGADINAATSFDFTYYRI^ 
MKEEIDLERNIIIEEKKLGETYPGRIYEKMDKFLTSGSLYEFRSPIGLEEQ 
VVGDIDPIEIEEKIKKQFVSWKNPTDKIKEVKVSLDVELKDKFLLLEDLETO 

IKKSLLAALFENRFSELKTAGVKQFKNVSNKDFFSFKSDNNTI VAKS I SLNFNPDHLNEGIQDFFYELERIRKFGF 

TQGELEKVRSQFYKSLELRKKNINKTNSWAIFQDLIEIAINGSNKFDM 

VKNCAIFYSYHGRAHFVLTLEDIDNLQKIALKRELKPYEN^ 

EVYFKYNDQKKGVIDFSATSWGGLINEDLKLI PVLSFAPGWSGSGYGDYSALQI EKYLSDKAVSLRVGVGAQES Y 
I SGSSDKKDLETLFQLI YFTFKEPKIDDVSLQNAINNIKALI KSNENSSDYHFHKAI SKFLNNNDPRFEDTKDSDL 
QYFTKENI LS FYKKRFTYANNFKFVLLETQ I FRQZ 

t373.aa 

CVSNEI^LDQSLVKGKIjVNGLRYY I YKNQT PKNAVNMG I VFNVGSLNEEDNERGI AHYLEHMAFNGTKDYPGNS IV 
DVLKKFGMQFGADINAATSFDFTYYRLDLSDGNNKDEIDES INILRNWASQISFMKEEIDLERNI I IEEKKLGETY 
PGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVIW 
NPTDKIKEVKVSLDVTSLKIDKFLLLEDLEVGEPSL^ 

KQFKWSNKDFFSFKSDNNTIVAKSISLNFNPDHLNEGIQDFFYELERIRKFGFTQGELE 
INKTNSV^IFQDLIEIAINGSNKFDMNEYC^ 

IDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVI^^ 

GLINEDLKLI PVLSFAPGWSGSGYGDYSALQI EKYLSDKAVSLRVGVGAQESYI SGSSDKKDLETLFQLI YFTFK 
EPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAI SKFLNNNDPRFEDTKDSDLQYFTKENI LSFYKKRFTYANNF 
KFVLLETQI FRQZ 



f373.nt 
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ATGAATTATCAAAGAATTAAGA 

ATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATC 

AACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGC 

ATAGCGCATTATCTTGAACATATGGCTTTTAATGGT^ 

AAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTO 

GTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGA 

ATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCT^ 

TTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTC 

AATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATA 

ATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATT^ 

TGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTC 

ATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTT^ 

AAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAAT^ 

CTTTAATCCAGATCATTTGAACGAAGGAATACAAGACT^ 

ACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATO 

CAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTAT^ 

TTGCGATCTTTOTTTTCAATATTTGGAAAAGAT^ 

GTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATC 

TTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAA^ 

TTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTC 

GAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAACT 

ATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGA 

TGCATTACAGATTGAAAAATATTTATCAGATA 

ATTTCTGGAAGTTCAGATAAAAAAGATCTTGAAACTCTTTT^ 

TTGATGATGTTTCTTTGCAAAATGCTATTAATAA 

TCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGA 

CAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAC^ 

TGCTGGAGACTCAGATATTCAGACAATAA 

t373.nt 

TGTGTTTCTAATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAG 

ATAAAAATCAAACCCCAAAGAATGCCGTTAATATGGGAATTC 

TGAGAGGGGAATAGCGCATTATCTTGAACATATGGCTTTTAAT^ 

GATGTTCTTAAAAAATTTGGAATGCAATTTGGTGCTGACATO 

GACTTGATTTGTCAGATGGTAATAATAAAGATGAAATTGATGAATC 

AATCAGTTTCATGAAAGAAGAAATAGATOTAGAGCGAAATATTATTATT^ 

CCTGGAAGAATTTATGAGAAAATGGATAAGTTTTTGAC^ 

TTGAAGAGCAAATTTTATCTTTTCAGCCA 

AAGTGTTATTGTGGTAGGAGATATTGATCCTATAC5AAATTGAAGAGAA(^ 

AATCCAACCX5ATAAAATTAAAGAAGTAAAAGTAAGTTTAGACCT 

ATTTGGAAGTTGGAGAGCCTAGTTTAATGTTCTTTAAA 

TTTAAATGCTATTAAAAAGTCTTTATTAGC 

AAGCAATTTAAAAATGTTTCAAATAAAGATTTTTTCTCA 

TTTCTTTAAACTTTAATCCAGATCATTTGAACGAAGGAA 

ATTTGGATTTACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAAT^ 

ATAAATAAAAGAAATTCATGGGCTATTTTTCAGG^ 

TGAATGAATATTGCGATCTTTCTTTTC 

AGAGTTTGATGTAAAAAATTGTGCAATTTTTTATTC 

ATTGACAATCTTCAAAAGATAGCTTTAAAAAGAGAGTTAAA^ 

TTAAGAAGTCTTTAGATGATAAAGATArrATTAGAGAAAATGAGTTTGAAAA 

AAATGGGGTTGAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTA 

GGTTTAATTAATGAAGATTTAAAACTTATTCCTGTTTTATC^ 

GTGATTATTCTGCATTACAGATTGAAAAATATTTATCA^ 

AGAATCATATATTTCTGGAAGTTCAGATAAAAAAGATC^ 
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GAACCCAAAATTGATGATGTTTCTTTGCAAAATGCT^ 
GTTCTGATTATCATTTTCATAAAGCCATTAGTAAA 
TAGTGATTTGCAATATTTTACAAAAGAAAATATTTTGTC 
AAGTTTGTCTTGCTGGAGACTCAGATATTCAGACAATAA 

f384.aa 

MDWDFEKIIFLLNESTRI^SGCAKLILDFKSDGSIVTQ^ 

LISESTFIIDPIIXSTSSFAAGLPSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDF^ 

FIFDNSKCYNIHSLLAVSRSIIRLFNIJDISSHIHINGSCVYSFAKLFTGSYKAYFS 

GEFYCGNKMTLDI LDSMYILEPNNHKRWSLKDFF I YSDNKSTIDI I RKDANKKINK 

t384.aa 

CAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETI^ 

PSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNKFIFDNSKCYNIHSLLAVSRSII 

RLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFV^ 

NNHKRWSLKDFFI YSDNKSTIDI IRKDANKKINKZ 

f384.nt 

ATGGATTGGGATTTTGAAAAAATTATATTTTTAT 

TAATTTTAGATTTTAAATCTGATGGGTCTATTGTAA 

GATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATA 

TTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTC 

GAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATO 

TATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAAGATTGG 

TTTATTTTTGATAATTCTAAATGTTAC 

ATCTTGATATTTCTTCTCATATTCATATO 

TAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTC 

GGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGAT^ 

AAAGATGGTCCTTGAAAGATTTTTTTATTTATTC 

TAAAAAAATCAATAAGTAA 

t384.nt 

AGTGGTTGTGCTAAATTAATTTTAGATTTTAAATC 
AATTCITATTCAAAGAGATCAAAAAGCCT^ 

GTATATCAAAGATGCTTTAATATCAGAGAGTACTTTTATTATTGATC 

GGCCTTCCTTCATATGGAATATCGCTAGCGTATGCTAGTGC^ 

TAAGCGGAGAGTTTTTTATTACTTCTAAAGATAA 

AAAGGATTTTAATAAATTTATTTTTGATAATTCTAAATO 

ATTATAAGGTTATTTAATCTTGATATTTCTTCTC 

TTTTTAGAGGTTCTTATAAGGCCTACTTTTC 

TAAATTGGGCATGGTTGGCGAATTTTATTG 

GAGCCTAATAATCATAAAAGATGGTCCTTGAAAGATTTTTTTATTO 
TAAGAAAAGATGCAAATAAAAAAATCAATAAGTAA 

£860. aa 

MAFYKLNDNIALAEDLLKYLLSSILNECSQDMDFLENYIEKGLIKKLEW 
KPYWGIDLQTDHERYLTEETFKKPVVVTDYPKNFKAFYMKANK^ 

RIKEIiNLNIEHLNWYLDLRRFGSAPHSGFGLGLERLVQYSTGI SNIRDS I PFPRTPKNLYFZ 



t860.aa 
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CSQDMDFLEOTIEKGLIKKLEWINS^ 

IDYPKNFKAFYMKANKDNKTVKGMDILVPKIGE I IGGSEREDDLQKLENRIKELNLNI EHLNWYLDLRRFGSAPHS 
GFGLGLERLVQYSTGI SNIRDS I PFPRTPKNLYFZ 

f860.nt 

ATGGCTTTTTATAAGCTTAACGACAATATTGCCCTAGCAGAAGATC 

ACGAATGCTCACAAGATATGGATTTTTTAGAAAAOT^ 

AAATTCAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTC 

AAACCTTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACA 

TGGTCATTGATTATCGAAAAAATTTCAAAGCATTT^ 

GGACATACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGA 
AGAATAAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTT^ 
ATTCTGGCTTTGGACTTGGACTTGAAAGATTGGTGCA^ 
ATTCCCAAGGACTCCTAAAAATCTTTATTTTTAA 

t860.nt 

TGCTCACAAGATATGGATTTTTTAGAAAATT^ 
CAAATTTTGAGGTTATTACCTATACTAAAGCAAT 

TTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGA 

ATTGATTATCCAAAAAATTTCAAAGCATTTTAC7VTGAAA 

TACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATC 

AAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATC 

GGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACA 

CAAGGACTCCTAAAAATCTTTATTTTTAA 

f446.aa 

MKILRJjCLLFLFFACTFDYDEYSSRSDVAJGCFPSIQILGIK^ 

YHSLDNEISGKFNNLEGSYITKDLDMRDSVEFKIEDKNNYYL^ 

KGFSYFLKSNVFYFDSGVEGIMNZ 

t446 .aa 

CTFDYDEYSSRSDVAKKFPSIQILGIKYYDVVYNra^ 

LEGSYITKDLDMRDSVEFKIEDKNNYYLLNSNRIiLWKNKDKKLQSP 

DSGVEGIMNZ 

f446.nt 

ATGAAAATACTTAGAOTTTGTTTGTTGTTTTTC 

CTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAA 

AACXX5TTTTAAATTCTTTAAGCTTTA 

TATCATTCCCTAGATAATGAAATTTCAGGGAAGTTTAATAAT^ 
TGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAA 

GAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGTATTAATTAGAT^ 
AAAGGATTTTCTTATTTTTTAAAGAGCAATGOT 

t446.nt 

tgtacttttgattatgatgagtattctagtagatctgatc 

tcaagtattatgatgttgtatacaataaagagca^^ 

taaaatttataaggcagagaatggaaggtttttatatcattcccta 

ttggaaggttcttatattacaaaggatttggat^^ 

attatttccttaattcaaataggcttttatggaagaata^ 
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ATTAATTAGATTTAATGATAGC^ 
GATTCTGGAGTTGAAGGAATCATGAATTGA 

f457.aa 

MKQKLSWILLFCFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDIJCAK 

FFNNLRYEIIGRKNISKGFEFEWIKNINFQNGIEKFLAKLNKIEGRSLNI 

FDYTEVVHFFRVVKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

t457.aa 

CFLSCRSESRIiAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGL^ 

RKNISKGFEFEWIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEiKKERKK 

WKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

f457.nt 

ATGAAGCAAAAATTAAGTTGGATTTTATTATTTT^ 

TTTTAATAGAGTTTTTTGATTCTATTAAAAAT^ 

AAGTGATGATGATCTGAAGGCAAAAATTCGTGGGTTGAAATC 

TTTTTTAATAATCTAAGATATGAGATAATAGGTAG 

AAAATATTAACTTTCAAAACGGTATAGAAAAATTTT 

TAAAAATTTAGAAAAAAAAGAGCGTAAAAAAATATTTGACAATTTAATAAA 
TTTGATTAGACTGAAGTTGTTCATTTTT^ 

GAGATGTTTTAAATATACAGTCTAGAAATAAGCTTATTAATGATCT^ 
t457.nt 

TGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAA^ 

TTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAA 

GTTGAAATCTCAGGCAAAGGATGATTTCATTTTTT 

AGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGT^ 

TTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTT 

ATTTGACAATTTAATAAATGAAGTTATTGGAGAG 

GTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAGGAGATC 
TTATTAATGATCTTTTTTTGGTTTO 

f542.aa 

MRIVIFIFGILLTSCFSRNGIESSSKKIKISMLVDGV^ 

YVSDLDNLKRNGSDLIWLVGYMLTDASLLVSSENPKI SYGI IDPI YGDDVQI PENLIAVVFRVEPRCFFGWLYCSQ 
KKLFWQNRFYRGNEGZ 

t542.aa 

CFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPEN^ 

LIWLVGYMLTDASLLVSSENPKI SYGI IDP I YGDDVQI PENL I AWFRVEPRC FFGWLYCSQKKLFWQNRFYRGNE 
GZ 

f542.nt 

ATGAGAATTGTAATTTTTATATTCGGTATTTTC 

AAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACGACA 

ATTACGCTTGAAAAAAGATTTTCCAGAAAATATTGAAG^ 

TATGTTTCAGATCTTGATAATTTAAAAAGGAATGG 

CATCTTTATTGGTTTCATCGGAGAATCCAAAAAT^ 
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GATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTA 
AAA^GCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAAGGGT^ 

t542.nt 

TGCTTTAGTAGAAATGGAATAGAATCTAGTTCAAAAAAAATTAAGATATCCA 

ACAAATCTTTTAATTCTAGTGCTAATGAGGCTTTAT^ 

TTTTTCTTGTGCTATTTCTGGAGTTTATTCTAG 

TTGATTTGGCTTGTAGGGTACATGCTTACTC^ 

GAATAATAGATCCCATTTATGGTGATGATGTTCAGATTCCTGAA 

AAGGTGCTTTTTTGGCTGGCTATATIKX^GCCAAAAA 

GGGTAA 

f93.aa 

MKRI LAMHDI SSMGRTSLTIC I PVX SSFNMQVCPFWAVLSASTAYKKFEI VDLTDHLEKFINIWKEQNEHFDILY 

TGFLGSEKQQITIEKIIKLIKFEKIVIDP^ 

NKDDIIKAILNLIXrKATVVVTSVKRGNLI^ 

LEKTTKAIHLI I KES I KENVSKKEGVRI ENFLKNTFZ 

t93.aa 

CIPVISSFNMQVCPFVTAVLSASTAYKKFEIVDLTDHLEKFINIWKEQNEHFDILYTGFI^ 
KFEKIVIDPVFADIXSEIYPIFDNKIISGFRKIIKYAN^ 
TSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTC^ 
SKKEGVRI ENFLKNTFZ 

f93.nt 

atgaaaagaattttagcaatgcatgatatttcaa 

cttcgtttaatatgcaagtttgtccttttc 

agtggatttaaccgatcatttagaaaaatttatc^ 

accggatttctgggaagcgaaaaacaacaaataacj^^ 

taattgatcctgtgtttgctgacgatggagaaatttaccctatat^ 

aatcataaagtacgcaaacataataacacccaatatcacagaacto 

aacaaagatgatatcataaaagcaatattaaatcttgatacaaa 

gaaatctcttgggaaacatttgctacaatcctaaaaacaaagaatactc 

aaatttcagtggaacaggagatttatttacga 

ttagaaaaaacaacaaaggctattcacctaataataaaagagtcaattaaag 

TCCGAATTGAAAATTTCTTAAAAAATACATTTTGA 
t93.nt 

TGCATACCAGTAATATCTTCGTTTAATATGCAAGTTTC 

ATAAAAAATTTGAAATAGTGGATTTAACCGATCATTTAGAAAAAT^ 

CTTTGACATACTCTATACCGGATTTCTGGGAAGCGAAAAAC 

AAATTTGAAAAAATTGTAATTGATCCTGTGTTTGCTGA 

TTAGTGGATTTAGAAAAATCATAAAGTACGCAAACAT 

AAGCTCAAAACTTAACAACAAAGATGATATCATAAAAGC^ 

ACAAGCGTTAAAAGGGGAAATCTCTTGGGAAACATTTG^ 

TAGAAGGATTAGAACAAAATTTCAGTGGAACAGGAGATTTATTTACC^ 

TGAAACAGAGGAAGCCTTAGAAAAAAGAACAAAGGCT 

TC^AAAAAGAAGGGGTCCGAATTGAAAATTTCTTA 



fl05.aa 
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MGLYLKLLRQSINLKSLFPLSVLFFSCNV I IKNLKN 

KNVLDLINNRVLFRAFKNAYFIDQGSGLSVSILSKRKINIKVLSVMQDSOT 
DFIKSIANLQISEQILYLKAQMDiQiMFILDESEFVIFDLLIKNGFFSLINDSNYTSML^ 
YSFMFVIADYLHSNYWENFPQKIVINZ 

tlOS.aa 

CNVVDTDFSVLEFKVANFNLNDDFSQGLLDSAYNILNRSFDLIIIKNLKN^ 

GLSVSILSKRKINIKVLSWQDSCDLKLGLLVDFKFENNHYGIVIYNLSKDFIKS 

FILDESEFVIFDLLIKNGFFSLINDSNYTSMLAN^ 

NZ 

flOS.nt 

ATGGGCTTGTATTTGAAGTTGTTGAGACAAAGTATCAACTTGAA 

CCKSTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGT^ 

TCAAGGGTTACTTGATTCTGCTTATAATATTCTAAA 

AAAAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTT 

GTGGCCTTTCTGTTAGCATTCTTTCTAA 

TTTAAAATTAGGATTCCTTGTGGATTTTAAATTTG^ 

GATTTTATTAAAAGTATTGCCAATTTGCAAATO^ 

TGTTTATTTTAGATGAATCTGAATTTGTTATTTTTC 

TTCAAACTACACTTCAATGTTAGCAAATAAAATTGATTm 

TATTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATC 

TCAATTGA 

tl05.nt 

TGTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGT^ 

AAGGGTTACTTGATTCTGCTTATAATATTCTAAATC^ 

AAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAG 

GGCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAA 

TAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCAC 

TTTTATTAAAAGTATTGCCAATTTGCAAATTAGTO 

TTTATTTTAGATGAATCTGAATTTGTTATTTTTG^ 

GAAAOTACACTTCAATGTTAGCAAATAAAATTGAT^ 

TTCATTTATGTTTGTAATTGCAGATTATTTC 

AATTGA 

fl50.aa 

MKTFVT IGLSNLGIHLLEDLSRLDCQI 1 1 1 DTSKELI EEYD VI STESFWEQFTKNALKRI I PVDTDAWIDFDDD 
LGKSALVTHYCNLLGLKEI CVKTENRDDAEI LKTLGATKI I FPSKDAARRLT PLLVS PNLSTYNI IGYDI IVAETV 
IPKEYVGKTLFEADLRRECGIWIAVRNLSNSRYEFVIXSDYFFLKDDKrVTC^ 
EDENLNKDAEKKSRFLGI FNFMKI FQKDRKDNZ 

t!50.aa 

CQI I IIDTSKELIEEYDVI STESFWEQFTKNALKRI IPVDTDAVVIDFDDDIX3KSALVTHYCNLU3LKEICVKTE 
NRDDAEI LKTLGATKI I FPSKDAARRLTPLLVS PNLSTYNI IGYDI I VAETVI PKEYVGKTLFEADLRRECGITVI 
AVRNLSNSRYEFVDGDYFFLKDDKIVICGKPDS IENFTNNKDLIKDLI SGSKEDENLNKDAEKKSRFLGIFNFMKI 
FQKDRKDNZ 



f!50.nt 
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ATGAAAAGATTTGTTATTATTGGACTTAGT^ 

AAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCT^ 

GCAATTCACTAAAAATGCTTTGAAAAGAATAATTC 

CTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATOT 

GAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTA^ 

AACTCCATTATTAGTATCTCCAAATCTTTCAACTTATAATATTATO 

ATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAG 

TTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAA 

GAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGA 

AAAAAGATCGTAAGGATAATTAG 

tl50.nt 

TGTCAAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATG 

TTGAGCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGA 

TGATCTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTT^ 

AATAGAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTC 

GATTAACTCCATTATTAGTATCTCCAAATCTCT 

TGTTATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATC 

GCTGTTAGAAATTTAAGTAATTCTAGGTATGAATTTGT^ 

TAATTTGTGGTAAACCAGATAGCATTGAAAATTT^ 

TAAAGAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATC^ 

TTTCAAAAAGATCGTAAGGATAATTAG 

f219.aa 

MLIARIMNINTLFYGMIIIIFALISCNHKNIQYDKRIK^ IRSRL 

NSYKNSKIREIFGIVKVFDINTPKIKEISDSLMSDSYNNRWGSWEIIHNAERGINSLVYIV^ 

IDEIASTISIFKKIITTNNENIDNNEENNNTN^ 

t219.aa 

CNHKNIQYDKRIKKFIJ3KNKIEYKIDSENDFI1AFKDINN1^^ 
KEISDSLMSDSYNNRVFGSWEIIHNAERGINSLWIVKAEEFANDT 
EENNNTNESNEQPTLKQEKTNSTKESNNELKEDQI EEELQEI KAQZ 

f219.nt 

ATGCTAATTGCAAGAATAATGAATATTAATACATTATTCTACGGCATG^ 

GGAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTT^ 

CTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGA 

AACTCATATAAAAATTGAAAGATAAGAGAAATATTTGGAATT 

AAGAAATATCTGACTCGCTTATGAGCGATAGTT 

AGAAAGAGGAATCAACTCTTTGGTATATATTC^ 

ATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAA 

AAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAA 

TAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATC 

t219.nt 

TGCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTT 

ACTCAGAAAATGACTTTATAGCATTTAAAGATATAAAGAATAACGAAAA 

AAACTCATATAAAAATTCAAAGATAAGAGAAATATTTC 

AAAGAAATATCTGACTCGCTTATGAGCGATAGTTATAATAACAGAGTA 

CAGAAAGAGGAATCAACTCTTTGGTATATATTC 
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AATTGATGAGATTGCCTCAACAATAAGTA 

GAAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAAT^ 
CTAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

£229. aa 

MRVDLLPLVELSLYINLSFCCKDFSIFNRILEELKCHLILIX3HPIIKTLYIKHVDFCLSRQDNLKFIFTS 

LELLEEFTLEIIPGYVDFEKFKLLDEFC^ 

VNMPLQKKSHLKRDLQRIAFIYAZ 

t229.aa 

CKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCL^ 
FKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKI^^ 
IYAZ 

f229.nt 

ATGAGAGTAGATCTTTTACCTCTTGTCGAGTTAAGTCTTTATATTAAT^ 

TTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATC 

TAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATl^ 

TTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTC 

AATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTT^ 

AATTTCTTATAAAAAATTGAATATTTTGATTAACAATAOT 

GICAATATGCCTTTGCAAAAAAAATCTC^TCTCAAGCG^ 

t229.nt 

TGTAAAGATTTTAGCATTTTTAATAGAATTTTAGAGGAATO 

TAAAAACACTTTACATTAAGCACGTAGATTTTTGTTTATCTAGG^ 

GTCCAAGTATATTAATTTGGAGTTATTAGAAGAATTTACTTO 

TTCAAACTTTTGGATGAATTTTGTATTACTAGAATTAATC 

TTGTGGGGATACCCGAAATTTCTTATAAAAAATT^ 

GAATATTGACATGACTGTCAATATGCCTTTGCAAAAAAAATC 

ATATATGCCTGA 

f22.aa 

MLKTLTKIITISCLIVGCASLPYTPPKQNLNYLMELLPGANLYAHVNLIK^ 

KKENNDFALL IMGNFPKDI FWG I HKNRNTES I GNI FTNPKWKLKNSNI Y 1 1 PNKARTS IAITQKDI TAKDNNMLTT 

KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSS^ 

TNMTNLTISSHIKTTIKIX2NTVEIEFNIQKSSVESLIEKLASNIQT 

t22.aa 

CASLPYTPPKQNLNYLMELLPGANLYAHVNLIKNRSIYNSLSPKYKSVLG 
DIFWGIHKNRNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIM 

PTLI^PNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
DQNTVEIEFNIQKSSVESLIEKLASNIQT 

f22.nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTC^ 

CAAAAGAAAATCTAAATTACTTAATGGAACTTTO 

GAGGTCTATTTATAACTCTTTAAGCCCTAAAT^ 

AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGA 
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ATAGAAATACAGAATCAATAGGCAATATATT^ 

TCCAAACAAAGCTAGAACTAGCATTGCAATAACC 

AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGA 

TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTC 

TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAA 

ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TTAATATTCAAAAATCTAGTGTTGAAAGCCTTAT^ 

t22.nt 

TGCGCAAGCCTGCCTTACACTCCTCCAAAACAAAATCT^ 

ACGCCCATGTAAATTTAATTAAAAACAGGTCTATTTATAACTCTTT 

TATAAGCAATTTATACTTTAGCTATAAAAAAGAAAATAAC 

GATATTTTCTGGGGAATTCATAAAAATAGAAATACAGAATCAAT^ 

TTAAAAATTCAAATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAA^ 

AAAAGACAATAATATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAA^ 

CCAACATTATTGCTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTA 

ACAGCTTAAATCAAGAAGAATATATTTTTAAATCC^ 

AAAAAAGTTAATTCCAACCGTCTTGACAAAGATGACAAACCTCACAAT^ 

GACCAAAATACGGTTGA7WVTAGAATTTAATATTCAAAAATCTAGTC 

ATATTCAAACCTAA 

£32. aa 

MtnTCTLYLISLILLACNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLS 
KIEKTLEKTERYGI EGNWILVNYKGTKRYIFSKDINI VNNL I IDHSK 

t32.aa 

CNKNNKIPLIQKIJDLPKSSILjGFSNKMGIIIKDYAFLSKST^ 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSK 

f32.nt 

ATGAATACAAAAACATTATATTTAATATCCTT^ 

AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGC^^ 

TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGC^ 

AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAA 

OTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTO 

t32.nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGAT^ 

AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACT 

CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAAC^ 

GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTA 

TAATAATTGATCATTCTAAATAG 

fl86.aa 

MKKLI 1 1 FTLFLSQACNLSTMHKIOTKEDMKILYSEIAE 
FGTTPMQRIHKYDQSFNLTRE I LASGI ELNRVVNAWLNSPSHKEALINTDTD^ 
YKN 



tl86.aa 
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O^STMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDOT 
PMQRIHKYIX2SFNLTREILASGIELNRVV^ 

fl86.nt 

ATGAAAAAATTGATTATAATTTTTACACTGTTTTTATCTC 
CAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAA 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGA 
TTTGGCACAACCCCAATGCAAAGAATACATAAATAC^ 

GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAA 

CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGTTC 

TATAAGAATTGA 

tl86.nt 

TGCAATTTAAGTACAATGCATAAAATAGATACAAAAGAAGATATGAAAATTCTA 
GAAAAAAATTAAATCTAAACCATCTAGAAATAGATGATACCCTTGAAAAAGTT^^ 
GGGAGAAAATAGAACAATAACTCACACCCTTTTTG^ 
TTTAATTTAACAAGAGAAATACTGGCATCAGGAATTGAACTTAACAGA 

GCCACAAAGAAGCTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAACGACTC 
TATATTTGTAGTTCTTTTTGGAAAAAGAAAATATAAGAATTGA 

f216.aa 

MIRVLLGSLAVS FLFSICMVFLNYDNLFSKKVFYFHS SKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFG 

FLLSDSRFLYSFLKNGVYYVNLSREFYDSFNNGDYN^ 

LKEQS 

t216.aa 

CMWIiNYDNLFSKKVFYFHSSKGFVANLR^ 

YYVNLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVT 
f216,nt 

ATGATTAGGGTGCTTTTGGGGTCTTTGGCAGTAAGC 

ATC M l"rri"ri^AAAAAAGGl"l"ITTTATTTTCATTC 

ACAAAATTTGAAAGATAATTTAGATCTTTT^ 

TTTTTATTAAGTGATTCAAGATTTTTATATTC 

TTTATGATTCTTTTAATAATGGTGATO^ 

GTCTTTAATAAAAACAATGCGCTTTAACTATC 

TTAAAGGAGCAAAGTTGA 

t216.nt 

TGTATGGTTTTTTTAAATTATGATAATCTO 

CTAATTTAAGATATTTAAGAGATGAACAAAATTTGAAAGA 

AAGCAATGAAGGGTTTTCTTTTGGGTTTTTATTAAG 

TATTATGTAAATCTTTCAAGAGAATTTTATGATTCT^ 

ATGTTAAGGTCAATCTTTTTGCTATGTCTT^ 

TGTTATTCTTGTTGAAGGGTGTATCTTAAAGGAGCA 

f328.aa 

MAIKYARENNIPFIXalCLGLQI^VIEFARNVCGILDAOTE 

VILKKNTIAFKLYGQDRI IERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEI PENKFFVACQFHPELITR 
IENPAKLFLGLI KAC I 
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t328.aa 

CliGLQLAVI EFARNVCGI LDAOTEENLARDKPLKSPVIHLLPEQ 

RIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEIPENKFFVACQFH^ 

LGLIKACI 

f328.nt 

ATGGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTC^^ 

AATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATAC 

TCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAAT^ 

GTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGC 

ATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGOT 

AATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCC 

ATAGAAAATCCAGCCAAGCTTTTTCTAGGATTA 

t328.nt 

TGTCTTGGTTTGCAGCTTGCTGTAATAGAATTTGCTC 

ATTTAGCAAGAGACAAGCCCTTAAAAAGTCCTGTTATCCATTTACTTC 

CGCTACAATGAGGCTTGGTGGATATCCTGTGATTCTTAAAT^ 

CGGATAATTGAAAGATTTAGACATAGGTATGAAGTCAATAATGATTAT 

TAGTATCTGGATTTTCAAGTGATTTTAAAATGGCAAAATTAAT 

CCAGTTTCATCCAGAACTTATTACAAGAATAGAAAATC^ 

TGA 

f352.aa 

MNKTKimSLTYFI ILSCI SLFGANNNTI 

KLPENIRDKKL PQKRMDENDLKSVT ENYENKI KNI EKLLKTKNQKTS ENENKKI ES I EKKAKKYE I LTNKLKNE IV 
EIKKLLNKKIKPKEDENYEKINIENI EEETDDDFEDNYEYNDEIEXTNEDNYPSNEGI INNLKENLNENEKYYAIN 
EKKIDELEDRINENENTI LDLQRELRNFKKKDNSDKNLEEI EENLSS IGRI INDLKRKI SANEAINKENQKKIRTD 
KHKLKELEDKIKENEETI LKLQKELNNFKKKEIYQKPLNEETFTPS ITSKNDDLEENKKLKKEYLKPIEKKESRDL 
EENTKSTPKTTMIKTADFQIYPDIYLNNYKFKEKGD^^ 
NPILKNKEEFFRNLIEVKNIHELGIMYKNLKPEFKQIKIIK 

t352.aa 

CISLFGANNNTISYSSIEIPLEDLSEEFTtSSGNKSIX}INTS 
DENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNK^ 

NYEKINIENIEEETDDDFEDNYEYNDEIEXTNEDNYPSNEGI INNLKENLNENEKYYAINEKKIDELEDRINENEN 
TILDLQRELRNFK3O0DNSDKNLEEIEENLSSIGRI INDLKRKI SANEAINKENQKKIRTDKHKLKELEDKIKENEE 
TI LKLQKELNNFKKKEIYQKPLNEETFTPS ITSKNDDT■lEK^^CKLKKEYLKPIEKKESRDLEENTKSTPKTTMI KTA 
DFQIYPDIYLNNYKFKEKGDQFAFTOCENTYYIEIDPTNNLNEALK^ I SKYKFEKYFINPILKNKEEFFRNLIE 
VKNIHELGIMYKNLKPEFKQIKIIK 

f352.nt 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTOT 

ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGAC^ 

CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCT^ 

AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATC^ 

AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAA 

TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTTAA 

GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAAAATO 
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TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGA^ 

TACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAATGAAAACGA 

AAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACACTATTTT 

GAATTTTAAAAAAAAAGATAACTC&GATAAAAACTTAGAAGAAAT^ 

ATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAATAAACAAAGAAAATC 

AACACAAACTCAAAGAATTAGAAGATAAAATAAAGGAAAATGAAGAGACTATTTO 

CAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGA^ 

GACGACTTAGAAGAAAATAAGAAATTAAAAAAGGAATATTTAA^ 

AAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAGATTTTCAAATC 
TAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAAAG 
CCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAA 
ACCCTATTCTAAAAAATAAAGAAGAATTT^^ 

GTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATAA 
t352.nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTAC 
AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAA 

TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCC 

GACGAAAATGATCTAAAATCTGTAATTGAAAATTATGAAAATAAAATTA^ 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAA 

AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTT^ 

AATTACX3AAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGA 

ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATG^ 

GAAAACGAAAAATATTATGCTATTAATGAAAAAAAAATCGATGAAC 

CTATTTTAGACTTGCAAAGAGAATTAAGGAATTTTAAA 

GGAAAATTTATCTTCAATAGGAAGAATAATTAATGATCTAAAAAGAAAAA 

GAAAATCAAAAAAAAATAAGAACTGATAAACACAAACTCAAAGAATTAGAAGATAAAAT 

CTATTTTAAAACTTCAAAAAGAATTAAACAATTTTAAAAA 

TTTCACTCCAAGCATTACAAGTAAAAATGACGACTTAGAAGAAAATAAGAAATTAA 

ATAGAAAAAAAAGAAAGCCGAGATCTAGAAGAAAATACTAAAAGCACCCCAAAAACAACTATGATA 

ATTTTCAAATCTACCCTGACATATATCTTAATAATTATAAATTTAAAGAA^ 

AGAAAACACATACTATATTGAAATAGATCCCACTAACAATTTAAATGAGGC^ 

AAATATAAATTTGAAAAATATTTCATTAACCCTATTCT 

TCAAAAATATCCACGAACTAGGAATTATGTATAAAAATCTAAAGCCTGAATTO 
A 

f867.aa 

MNTKGKWG^/NGNLVTIEVEGSVSMNEVLFVKTAGRNLKAE SVGDLVEFTDKLLTV 

EliGPGLLTQVYDGLQNPLPEIAIQCGFFLERG^ryiiRPLNro 

PFYKRDSYKIVEIVSIX3DYSIDEQIAVIEDDSGM^ 

AKGGTFC I PGPFGAGKTVLQQVTSRNADVDWI IAACX3ERAGEVVETLKBFPELMDPKTGKSLMDRTCI ICNTSSM 
FVAAREASVYTAITIGEYYRQMGLDI LLLADSTSRWAQAMREMSGRLEEX PGEEAF PAYLESVIASFYERAGI WL 
NNGDI GSVTVGG SVS PAGGNFEE PVTQATLKWGAFHGLTRERSDARKF PAI S PLESWSKYKGVT DQKKTEYARS F 
LVKGNEINQMMKWGEEG ISNDDFL IYLKSELLDSCYLQQNSFDS IDAAVSSERQNYMFDI VYNI LKTNFEFSDKL 
QARDFINELRQNLLDMNLSSFKDHKFNKLEHALGELINFKKVI 

t867.aa 

GRNLKAEVI RI RGNEVDAQVFELTKG I SVGDLVT2FTDKLLTVELGPGLLTQVYDGLQNPLPELAI QCGF FLERGVY 
IJlPIiNKDKKWNFKKTSK\7GDIVIAGDFLGFVIEGTVHHQI^ 

RHNITMSFHWPVKVPITNYKERLIPSEPMLTQTRI IDTFFPVAKGGTFCI PGPFGAGKTVLQQVTSRNADVDWI I 
AATCERAGEWETLKEFPELMDPKTGKSLMDRTCIICNTSSMPVAAREA 

RWAQAMREMSGRLEEI PGEEAFPAYLESVIASFTERAGIVVLNNGDIGSVWGGSVSPAGGNFEEPVT^ 
AFHGLTRERSDARKFPAI S PLESWSKYKG^II^KKTEYARSFLVKGNEINQMMKVVGEEGI SNDDFLI YLKSELLD 
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SCTLQQNSFDSIDAAVSSERQimiFDIVYNI 
ELINFKKVI 

f867.nt 



ATGAATACAAAAGGAAAAGTCGTTGGAGTTAATGGAAACTTAGTTACTATTC 

ATGAAGTTTTATTTGTAAAGACTGCTGGTAGGAATTTAAAAGCAGAAGTAA 

TGCACAGGTTTTTGAATTGACAAAAGGGATATC 

GAACTCGGACCAGGGCTTTTAACTCAAGTATATGATGGGCTTCAAAATC^ 

GATTTTTTTTAGAAAGGGGAGTATATTTAAGGCCCTTGAATAAAGA 

AGTTGGAGATATCGTTATTGCAGGAGATTTTTTAGGTTTTC 

CCATTTTATAAAAGGGATTOTTATAAAATTGTGGAGATTGTAAGTG 

CTGTAATTGAAGATGATTCTGGTATGAGGCATAAT^ 

TAATTATAAGGAACGCCTTATTCCTAGTGAACCTATGTTGACTCAAACTAG 

GCCAAAGGTGGAACTTTTTGCATTCCGGGTCCTT^^ 

ATGCTGATGTTGATGTAGTGATTATTGCAGCTTGTG^ 

TCCCGAATTAATGGATCCAAAAACCGGCAAATCTTTAATGGACAGGA 

CCAGTTGCAGCTAGAGAAGCTTCTGTTTATACTGCTA 

TTCTTCTTTTGGCAGATTCAACTTCAAGATGGG 

TGGCGAGGAGGCTTTTCCGGCATATCTTGAGTCTGTTATTGCTTC 

AATAATGGGGATATTGGATCTGTAACAGTTGGTGGCTCTGTAAGTC 

CTCAAGCAACTTTAAAAGTTGTAGGAGCATTTCACGGGCTTACAAGAGAAAGGTC 

TATTAGTCCTCTTGAATCTTGGAGTAAATATAAAGGCGT^^ 

TTGGTGAAAGGTAATGAAATTAATCAAATGATGAAAGTTGTTGGAGAA 

TTTATTTAAAATCCGAGCTACTTGATTCGTGCTATTTGCAGCAA 

TTCAGAGCGTCAAAATTATATGTTTGATATAGTTTATAAGATTCTTA 

CAAGCAAGAGATTTTATAAATGAGTTAAGGCAAAATCTTTTAGACATC 

TTAATAAATTGGAGCATGCTTTGGGTGAATTGAT^ 

t867.nt 

GGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTG 
GGAT 

ATCTGTTGGAGACCTAGTTGAATTTAGAGACA^ 

TATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTC 

GGCCCTTGAATAAAGATAAAAAGTGGAATTTTAA 

TTTAGGTTTTGTAATTGAGGGAACTGTT^ 

GTGGAGATTGTAAGTGATGGCGACTATTCX^TTGATGAGCAA^ 

ATAATATTACAATGTCTTTTCATTGGCC 

ACCTATGTTGACTCAAACTAGAATTATAGATACATTTTTCC^ 
CCTTTTGGAGCAGGAAAAACGGTTCTTC^GCAGG 
CTTGTGGTGAGCGAGCAGGAGAAGTGGTAGAAACTCTTAAAGAA 
ATCTTTAATGGACAGGACTTGTATTATTTC 

ACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTOGATATTCTTC 

GGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCCTG 

GTCTGTTATTGCTTCCTTTTATGAAAGGGCAGGTATTGTA 

GGTGGCTCTGTAAGTCCTGCTGGTGGTAATTTTGAAGA 

TTCACXXXXTTTACAAGAGAAAGGTCTGATGCTAGGAAATTTCCA 

TAAAGGCGTTATTGATCAAAAAAAGACTGAATATGCAA 

ATGAAAGTTGTTGGAGAAGAAGGCATAAGTAACGATGATTTTTTAA 

GCTATTTGCAGCAAAATTCATTTGATTCTATTGATC 

AGTTTATAAC^TTCTTAAAACTAACTTTGAGTTTTCTGA 

CAAAATCTTTTAGACATGAATCTTTCTTCTTTTA 

TGATAAATTTTAAAAAGGTAATTTAG 
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f868.aa 

MKRVYSKIES IAGNVITVTAQGIKYGELAIVKAKDTSSLAEVI KLDREKVSLQVYGGTRGV STSDEIKFLGHSMQV 

SFSDNIJjGRIFIXSSGNPRDGGPSLDDNLIEIGGPSANPTKRIVPRNMIRTC 

YNEIJ^IRIALQAEVDLIILGGMGLKHDDYLTFKDSLEKGGALSRAI 

GKKVLVLLTDMTNFADAMKE I S ITMEQVPSNRGYPGDLYSQLAYRYEKAI DFEGAGS I TI IiAVTTMPGDDVTH PVP 
DNTGYITEGQYYLKGGRI EPFGSLSRLKQMVNSRTRDDHRTIMDSMIKLYAS SKESVEKKAMGFNMTKWDEKLLKY 
SNMFESKMMDLSWIPLEEALDLGWSIIASCFSPKETGIKTDLIEKTO 

t868.aa 

QGIKYGELAIVKAKDTSSLAEVIKLDREKVSI^W 
GPSLDDNLIEIGGPSANPTKRIVPR1TOIRTGLP 

GMGLKHDDYLTFKDSLEKGGALSRAI FFVHTANDSWESLTVPDI SLSVAEKFALKGKKVLVLLTDMTNFADAMKE 
I S I TMEQVPSNRGY PGDLY SQLAYRYEKAI DFEGAG S I T I LAVTTMPGDDVTHP VPDNTG Y I TEGQYYLKGGRI EP 
FGSLSRLKQMVNSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKLLKYSNMFESK^ 
LDLGWSILASCFSPKETGIKTDLIEKYWPKKETY 

£868. nt 

ATGAAAAGAGTCTATAGTAAAATAGAGTCTATAGCAGGCAATGT^ 

GTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTC 

TCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTC 

TCATTTTCTGACAATTTGTTGGGCAGAATT^ 

ATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATC 

GCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATC 

TATAATGAGCTTCTTATAAGAATTGCACTTO 

ATGATGATTATTTAACTTTTAAAGATTC 

TGCTAATGATTCTGTTGTTGAATCTTTAACTGTTC 

GGCAAAAAAGTTTTGGTGCTTCTCAGAGAC^ 

AACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGT^ 

CTTTGAAGGCGCAGGATCAATTACAATACTTGCAGT^ 

GACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAA^ 

GACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCAC^ 

ATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGAT^ 

AGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTA^ 

GCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGC^ 

AGAGACTTATTGA 

t868.nt 

CAAGGTATTAAGTATGGTGAGCTTGCTATTGTAAAAGCAAAAGATACA 
ATCGAGAAAAAGTTTCTCTTCAGGTTTATGGTGGTACAAG 
GCATTCAATGCAGGTTTGATTTTC 
GGCCCTTCTCTTGATGATAATTTGA^ 

ATATGATAAGGACAGGGCTTCCAATGATAGATGTTTTTAATACTCTTC 

TGTTTCTGGTGAGCCTTATAATGAGCTTCTTATAAGAATT^ 

GGAATGGGACTTAAGCATGATGATTATTTAACTTTTAAAGA 

TTTTTTTTGTTCATACTGCTAATGATTCTC 

AAAGTTTGCTCTAAAGGGCAAAAAAGTTTTGGTGCTTC 

ATATCTATTACAATGGAACAAGTGCCTTOTAATAGAG 

ATGAGAAGGCTATTGACTTTGAAGGCGCAGGATCAATTACAATA 

TACTCATCCTGTTCCTGACAATACTGGATACATTACAGAAGGTCA^ 

TTTGGGTCTCTTTCAAGACTTAA 

TGATCAAGCTTTATGCATCTTC^U^AAGAGTCTC 

AAAATTGCTGAAGTATAGCAATATGTTTGAAAGTAAGA 
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TTAGATTTAGGTTGGAGCATTCTTGCTAGTTGTTTT 
AATATTGGCCTAAAAAAGAGACTTATTGA 

f872.aa 

MRSAVLFFFALPFSISLYSSSNKNFPYVTELLEKGRQFLYSKSEFSKSNL^ 
YGMSGNAILEKLNLYKSFEDRYYLIJJESFEKKILFSLAKMAELENNYV^ 

ENSMSNNELNASF YLTSYLKQVRGAFG I DFTFNLYRFKNYNVT DTHQLLSKVYLHLKAYELS I THGLI AAVG I LTR 
MYDYVCYYEPVYQFKNLRSFVQKINKYKAI KNAFE STDFWEI VYNVAAATYAYSNGNYKFRAI DTWKL WDLAPRF 
SPYIAKSRSQIKNSVYLKKN 

t872.aa 

SNKNFPYWI LLEKGRQFLYSKSEFSKSNLTHAINYI^EIALIjRKGVYPEAS YYLSVAYGMSGNAI LEKLNLYKSFED 
RYYLLDESFEKKILFSLAKMAELENNYViyriDYLNDILN^ 
QWGAFGIDFTFNLYRFKNYNVIDTHQLLSKVYTiH^ 
VQKINKYKAIKNAFESTDFWEIVYNVAAATYAYSNQmCFRAIDTWKL 

f872.nt 

ATGAGAAGTGCGGTTTTATTTTTTTTTC 

CGTATTGGATTTTACTTGAAAAAGGCAGGCAATTTC 

TGCTATTAATTATTTGGAGGAAGCTTTGCTTAG^ 

TATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCT^ 

ATGAATCTTTTGAAAAAAAAATACTTTTTTCT^ 

TGATTATTTGAATGACATATTAAATAAGTTTTCAAC 

GAAAACAGTATGTCAAATAATGAACTTAATGCTTCATTTTA 

TTGGTATTGATTTTACTTTTAATCTTTAC^ 

AGTTTATTTGCACTTAAAAGCTTATGAGCTTTCAATTACTC 

ATGTATGATTATGTTTGTTATTATGAACCTGTGTATCAGTTTAA 

AGTATAAGGCAATAAAAAATGCTTTTGAATCTA 

TGCATATTCTAATGGCAATTATAAATTTAGAGCAATAGATACTTGGA 

TCTCCTTATATTGCTAAATCAAGAAGTCAAATO 

t872.nt 

AGTAATAAAAATTTTCCGTATTGGATTTTAOTTG 

AGTCTAATCTTACACATGCTATTAATTATTTGCAGGAAG^ 

TTATTTGTCAGTAGCTTATGGTATGTCTGGCAATGCTATTC^ 

AGATATTATTTGCTAGATGAATCTTTTGAAAAAAAAA 

ATTATGTTGATACTATTGATTATTTGAATGACATACT 

TGATTATTCTCAAGGCGAAAACAGTATGTCAAATAATGAA^ 

CAAGTAAGAGGAGCTTTTGGTATTGATTTTACTTTTAATC 

ATCAATTATTGTCAAAAGTTTATTTGCACTTAAAAC^ 

AGGAATTTTAACAAGAATGTATGATTATGTTTGTTATTATG^ 

GTTCAAAAAATTAATAAGTATAAGGCAATAAAAAATGCTTTTGi^ 

TTGCTGCTGCTACTTATGCATATTCTAATGGCAATTATAAAT^ 

TCTTGCGCCAAGGTTTTCTCCTTATATTGCTAAATC 

TAA 

f874.aa 

MLKSNKWLIGAGGVGSSFAYALTIDNSLVHELVI IDVNENKAKGEVMDLNHGQMFLKKNINVLFGTYKDCANADI 
WITAGI^QKPGETRLDLVDKNSKIFKDII 

SRLRYFLSDHFNVNTQNIHSYIMGEHXDS SFATWDETKI AMKPLSEYLAEGKITEI^LDEIHKKWNAAYEVI KLK 
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GATYYAIGLGIKNIVNAIIGDQNVIL^ 
NQLKSYIDKMEF 

t874.aa 

ALTIDNSLVHELVIIDVNEfcHCAKGEVMD^ 

NSKIFKDIITNWSSGFIX5IFWASNPVDIMTYVT^ 

IMGEHXDSSFATWDETKIAMKPLSEYLAEGKITELELDEI 

QNVILPI SSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKI SPKELDKFNSSANQLKSYIDKMEF 
f874.nt 

ATGCTTAAGTCTAATAAAGTTGTTCTTATTGGAGCTGGTGGGG 

AGAATTCACTTGTACATGAACTTGTAATTATTGATGTO 

TCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTT^ 

GTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTC 

TTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGG 

TATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTC 

TCAAGACTTAGATATTTTTTAAGTGATCATTTTAATC 

ACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAA 

AAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGT^ 

GGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATC 

GCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAA 

TGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTA^ 

ATCAGCTTAAAAGCTATATTGATAAAATGGAATTTTAG 

t874.nt 

GCTTTAACAATAGACAATTCACTTGTACATGAACTTGTAATTAT^ 

TCATGGACCTTAATCATGGCCAAATGTTTTTAAAGAAGAATATT 

TAATGCAGATATTGTTGTAATTACAGCAGGACTTAATCA 

AATTCTAAAATTTTTAAAGATATTATAACTAATGTTGTATCTAGCGGT^ 

ATCCTGTAGACATTATGACTTATGTTACAATGAAATATTCC^ 

TATTCTTGATACTTGAAGACTTAGAT^ 

ATTATGGGTGAGCACGTGACAGTTCTTTTGCT^ 

TCTTGCTGAAGGCAAAATAACTGAGTTGGAGCTT^ 

ATTAAGTTAAAGGGGGCAACCTATTATGCTATTGGACT^ 

AGAATGTTATTCTGCCAATATCTTCTTATATC 

GCCTGCTATAGTTTGTAAGGAAGGAGTCAAAGAAGTTTO 

AATAGTTCTGCTAATCAGCTTAAAAGCTATATT^ 

£886. aa 

MKIOCQLILLLFMPQIIYAKSYFASDWFNKYQKLNEKPKTGFYIEYYSVDOT 
KITCYiyTKDTKRKEEIYDNLNNKIQEIEYDSKGKTLCT 

TGSNFQIWOTGINGDIKSTYFDIKKATTKVIKYDDKKRNSNSTI IVNNKIKSKEKNQYLDEEKIVNTFEEENTKI I 
STYKANNLIKEETYKNNELIKVNDFQYNESDMI IFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDI IYLKT 
EYHNDNEYEEEIYYNKKPALRVKHKNGKVTEEKPIGTN 

t886.aa 

SYFASDVFFNKYQKLNEKPKTGFYIEYYSVDDTEKLYLYKENNLIKYKTIQI 

LNNKIQEI EYDSKGKTLETANYVYENENLI SKNLKTINQKPKLI YYSKDDNGKLLKITGSNFQIWNYGINGDIKST 
YFDI KKATTKVI KYDDKKRNSNST I IVNNKI KSKEKNQYLDEEK I VNTFEEENTKI I STYKANNL I KEETYKNNEL 
IKVNDFQYNESDMI IFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKI YENDI IYLKTEYHNDNEYEEEI YYNKKPA 
LRVKHKNGKVTEEKPIGTN 
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TABLE 1. Nucleotide and Amino Acid Sequences 
f886.nt 

ATGAAAAAAAAACAATTAATACTTCT 

TATTTTTCAATAAATACCAAAAATTAAATGAAAAACCAAAAACGGG^ 

TACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCAAATC 

AAAATTACATGTTATGATACAAAAGATACAAAAAG 

AAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATT^ 

AAATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAATGG 

ACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAAT^ 

CAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTA^ 

ATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAATAGTA 

TCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTTATAAAAGTAAATG 

AATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACC^ 

TGAATACGAATATAACAAAGAGAATCAATTAAAAAGCAAAAAAA 

GAATACCACAATGACAATGAATATGAAGAAGAAATATACTACAATAAAAAACCTGCTC 
ACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA 

t886.nt 

AGCTATTTTGCATCTGATGTATTTTTCAATAAATACC^ 
AGTATTATTCTGTTGATGATACTGAAAAACTCTAC 

AATCATAGAAAACACAAAAAAAATTACATGTTATGATACAAAAGATACAAA7UVGAAAAGAAG 

TTAAATAACAAAATACAAGAAATTGAATATGATAGCAAAGG 

ACGAAAACTTAATATCTAAAAATTTAAAAACAATAAAC^ 

TGGTAAATTACTAAAAATAACAGGATCAAATTTCCAAATTT^^ 

TATTTTGACATCAAAAAAGCAACAACAAAAGTTATAAAATATGATGATAAAAA^ 

TTGTTAATAATAAAATAAAATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAAT^ 

AGAGAATACAAAAATCATATCTACCTACAAGGCAAACAACCT 

ATAAAAGTAAATGATTTTCAATACAACGAATCTGATATGATAATTTTTCAi^ 

AATACACCAATACTAAAATTGAATACGAATATAACAAAGACAATCAATTAAAAAG 

TATAATTTATCTAAAAACTGAATACCACAATGACAATGAATATGAAGAAGAAATATACTA^ 

CTTAGGGTAAAACACAAGAACGGAAAAGTCACCGAAGAAAAACCAATAGGA^ 

f888.aa 

MEKLKLKLAIPLLVFTICKIHSQSNIEYNFSYI INTKKENIDLKKGIEKQLDKI YDKITEHIVNNDDKSI IEDI YI 

NQDIIKTELEISKLKKEMDKKKLQNIITAKEKHNTKTKIDELKKNIQNINNKQKOT 

TNISNLNKEFFIRKELFFINYIDLKKIENY^ 

LKININVKNNSDAKI YINEKFVSKGI YHDNI FDI SKLPNKEI EIQITSANFENYS IKRTVKNADS I ILDIDLKRTI 
SKKVSIKSNVQSKVFKKGI FMGETPI EIEKPENQDI I LLKSKGYKDKFKLINKEEDQVEI EMIKTNKNRLIDTRDK 
FYVNLAVFTLSTIGAI FAGTLLNNSEVLYKITGNHFINKRLTAEDVYMAKAEQMTATFLFGV 
HLVEY I KEANMGE 

t888.aa 

SNIEYNFSYIIOTKKENIDLKKGIEKQIJ^KIYDKITEHIVNNDDKSIIEDIYINQDIIKTE 

QNI ITAKEKHNTKTKI DELKKNIQNINNKQKKF AEYFNNLKKLKVKYKK I EEQTNI SNLNK EFF I REELFF INY I D 

LKKIENYYLLEISNITPEKIETKKAVFKTSSSVNEIM 

KGIYHDNIFT)ISKLPNKEIEIQITSANFEtTYSIKRTVKNADS 

TPI EI EKPENQDI I LLKSKG YKDKFKL INKEEDQVEI EMI KTNKNRL I DTRDKFYVNLiAVFTLST IG AI FAGTLLN 
NSEVLYKITGNHFINKRLTAEDVYMAKAEQMTATFIJ^GVGITLTIGSFISLITHLVEYIKEANM 



f888.nt 
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ATGGAAAAGCTTAAACTAAAGCTAG 
TTGAATACAATTTTTCCTATATCATTAATAC^^ 

GGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGATC 

AATCAAGATATAATAAAAACAGAACTTGAAATTAGCAAATTAAAAAAAGAAATGGA 

TAATAACCGCAAAAGAAAAGCATAACACCAAAA(X!AAAA 

TAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTA 

ACAAATATATCAAATTTAAATAAAGAATTTTTTATAAGAGAAGAAT^ 

AAATAGAAAATTATTATTTGCTAGAAATTAGCAACATC 

AACATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCC 

TTAAAAATGAACATTAACGTCAAAAATAACTCGGATGCAAAAATC 

TCTATCACGATAATATTTTTGACATTTCTAAACTC 

OSAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGACTCAATAATATTAGA 

TCTAAAAAAGTATCAATTAAAAGCAATGTACAATCTAAAGTTTTTAAA 

TTGAAATTGAAAAACCAGAAAATCAAGATATCATCT^ 

AAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGACTTATCGACA 

TTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCC^ 

AAGTACTTTATAAAATAACAGGCAATCACTTTATTAACAAA^ 

GGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATGA 

CATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

t888.iit 

AGTAATATTGAATACAATTTTTCCTATATCATT^ 
AACAATTGGACAAAATCTATGATAAAATAACAGAACAT^ 

TTATATAAATCAAGATATAATAAAAACAGAACTTGAAATTAGCAAATTAAAAAAAGAAA 

CAAAACATAATAACCGCAAAAGAAAAGCATAACACCAAAAC 

TTAACAATAAACAAAAAAAATTTGCAGAATATTCTAAC 

AGAGCAAACAAATATATCAAATTTAAATAAAGAATTT^ 

CTTAAAAAAATAGAAAATTATTATTTGCTAGAAATC 

TATTTAAAACATCATCTTCTGTTAATGAAATTGCAGATC 

AGAATTTTTAAAAATGAACATTAACGTCAAAAATAACTC^ 

AAAGGAATCTATCACGATAATATTTTTGACATTTCTAAAC^ 

CAAATTTCGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGA^ 

AACAATCTCTAAAAAAGTATCAATTAAAAGCAATGTACAATCT 

ACCCCAATTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTG^ 

AGTTAATAAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGA 

AGATAAATTTTATGTCAATCTGGCCGTCTTTAGAT^ 

AATTCAGAAGTACTTTATAAAATAACAGGCAATGACTTTATTAACAA 

CAAAAGCGGAACAAATGACTGCAACATTTCTATTT^ 

AATAACTCATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

f893.aa 

MVRFIjGFLYIjITTIPLIKSCDAAQFGDYKPLYFENENDLKTANEYINSLGYKTISE^ 

INKLNNLDLRKS I FLKKLSNLFNI EHKKLLYVENRFKS INFKNLKKELNINADIHSLDYKTKINF ISSIIFLIIII 

LLIFLDPTNSIFTLIFLLISSLAFTOISKEIMYFYPFTVLSYLLFL 

LFTFTALYFITITTFFTTNIDPTFIAFVAIPTLCIFLI^^ 

LLLFTLSLIPFAYSSYMLNSYENINYLYSKKLNYFD^ 

YGKIPKDIKENYFEIKNDKIEIHPKTVYEVDKSF 

FSLPFFVLLFLFKAIRFTILLNIN 

EKTYKKYIQG 

t893.aa 

CDAAQFGDYKPLYFENENDLKTANEYINSLGYKTI SEYTTKIDILDFPENKEITINE INKLNNLDLRKS I FLKKLS 
NLFNIEHKKLLYVENRFKSINFKNLKKEI^INADIHSLDYKTKINFISSIIFLIIIILLIF 
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SSIAPMISKEIMYFYPFTVLSYLLFLIISNFNKNYNKIYLKEIOT 
IDPTFIAFVAIPTIXTIFLIFSWIKTESNFKOT^ 

SYENINYLYSKKI^FDYLNPNNIYIMI^YNKDMPNI IGYLSHILYQNELKYNITAKYGKI PKDIKENYFEIKNDK 

IEIHPKTVYEVDKSFIDEILKKDLASLFLKNKNPI^ 

LLNINEKTYKKYI QG 

f893.nt 

ATGGTGCGTTTTTTAGGTTTTTTATATTT^ 

GAGACTACAAACCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGC 

GAAAACAATCTCAGAATAGACAACAAAAATTGACA^^ 

ATAAACAAACTTAACAATCTTGACCTGAGAAAAAGCATATT^ 

ACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTTO 

TGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTT^ 

TTATTAATTTTTTTAGACCCAACAAACTCTA 

TAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTC 

CAAAAATTACAATAAAATATATTTAAAAGAAATAAATTTTTTAACACT 

TTATTTAGATTCACAGCTCTATATTTCATT^ 

CATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTC 

TTTCTTATTCCCAATCGAGATTAAAGAGAAAAAAATAGAAGGAA 

CTACTACTATTTACACTCTCATTAATTCCTTTCGCTTATTC 

ACCTTTACAGTAAAAAATTAAATTACTTTGATTATTTAAAT 

AGACATGCCCAATATTATAGGGTACCTATCCCACATTCTTTATCAAAACGAACTAAAATA 

TATGGAAAAATTCCTAAAGATATAAAAGAAAATTACTTTGAAATCAAAA 

CTK5TTTACGAAGTAGACAAATCATTTATTGATG 

AAATCCAATCCTAATATATAAAGAAAACAAGAATAATATCAACACAGATAAAAAAAATTACAA^ 

TTCTCTTTGCCCTTCTTTGTATTACTATTCCTATTTAAAGCAATAA 

AAACCTATAAAAAATATATTCAAGGATAA 

t893.nt 

TGTGATGCAGCTCAATTTGGAGACTACAAACCTTTAT^^ 

ATATAAATTCACTAGGATAGAAAACAATCTCAGAATACACAAC^ 

AGAAATCACAATAAATGAGATAAACAAACTTAACAA^^ 

AATCTTTTCAACATAGAGCACAAAAAACTTCTT^ 

AAAAAGAACTCAATATTAATGCCGACATACATTCTCTTC^ 

ATTTCTAATCATAATAATTTTATTAATTTTTTTAGAC 

TCATCTCTTGCTTTTATGATAAGCAAAGAAATAATC 

TAATAATCAGTAATTTTAACAAAAATTACAATAAAATATATTTAAAAGAAATAAAr^ 

AATAAAACACTTACTATTTTTATTTACATTCACAGCTCTATAT^ 

ATTGATCCCACTTTTATTGCATTTGTCGCAATACCAACCC 

AAAGCAATTTTAAAGACACTTTCTTATTCCCAATCGAGATO 

ATCAAAAATAGCAATACATCTACTACTATTTACACTCTCATTAATTC 

TCTTATGAAAACATTAACTACOTTTACAGTAAAAAATTAAATTACTT^ 

TAATGCTGGGATACAACAAAGACATGCCCAATATTATAGGGTACCTATCC 

ATACAATATTACCGCTAAGTATGGAAAAATTCCTAAAGATATAAAAGAAAATTACTT^ 

ATAGAAATTCATCCTAAAACTGTTTACGAAGTAGACAAATCATTTATTGA 

GTCTGTTTTTAAAAAATAAAAATCCAATCCT^ 

TTACAAAATACTTTTCTTTTTCTCTTTGCC 

CTTTTAAACATAAATGAAAAAACCTATAAAAAATATATTCAAGGATAA 
£895. aa 

MIRALLTNDLFLSCLVSGISAQVIKYGI 

TNFIIALAFALITIRDSFGVRYMSGVQAEYLNALSEKL^ I IGIVSAYI VCY 

F 
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t895.aa 

AQVIKYGIQTVKTRKLKLTPVHLLKKIFLET 

RYMSGVQAEYLJNAIjSEKLKKEIK IDTTK I KWKGHKKKEVLTG III G I VS AYI VCYF 
f895.nt 

ATGATAAGGGCATTGCTTACCAATGATCTTTTTTTGTCTTC 
ATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAAC 

AGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACT^ 

ACA^TTTTATAATAGCTCTTGCATTTGCCCTTATTACAA 

TTCAAGCAGAATATTTAAATGCATTATCAGAAAAA 

GGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAAT 
TTTTAG 

t895.nt 

GOTCAAGTGATTAAATATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTC 
AAATTTTTCTAGAAACAGGAGGCATGCCAAGTAGTCAI^ 
AACTGAAGGAATAGATACAAATTTTATAATAGCTCTTGCATTTGCCCTTATO 
AGATATATGTCTGGAGTTCAAGCAGAATATTTAAATGCAT^ 

CAACAAAAATAAAAGTGGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAA 
GTATATTGTGTGCTATTTTTAG 

f605.aa 

MYIGAAGKSFSI IIDSAFLSNCFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKWCGSKYSPMRPTLI I 
SKLFVFLLLVRTGQFSLVSIRLIFRIFFHWFZ 

t605.aa 

CFLFIGSFSRSDSU«SLSNSRFEYPYDASCEFSLVNIVKWCGSCT 
LIFRIFFHWFZ 

f605.nt 

ATGTATATTGGTGCAGCAGGAAAATCTTTTTCAATTATO 

TAGGATCTTTTTCAAGATCTGATTCTCTC 

TCAATTTTCTCTTGTGAATATAGTAAAGTATGTGTC 

TCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAA 

TTTTTTTCCATTGGTTTTGA 

t605.nt 

TGTTTTCTTTTTATAGGATCTTTTTC 

ATGATGCAAGTTGTGAATTTTCTCTTGTGAATATAGTAAAG 

AACTCTTATTATTTCAAAATTGCCAGTATCT 

TTGATATTTAGAATTTTTTTCCATTGG 

f606.aa 

MKLQRSLFLIIFFLTFLCCTCNKERKEGVSFKISLGAEPSSIjD^ 
AKGWDISSIX3TVYTFNIJ*EKITWSra 

RAIDEKTLEITLESPKPYFIDMLVHQSFIPVPVHVTEKYGQNWTSPENMVT PNEKYVFEKNNKYYD 
SNEVELEEITFYTTNDSSTAYKMYENEELDAI FGSI PPDLIKNLKIjRSDYYSSAVNAI YFYAFNTHIKPLDNVKIR 
KALTLAIDRETLTYKVLDNGTTPTRRATPNFSSYOT 



WO 98/59071 



PCT/US98/12718 



182 

TABLE 1 . Nucleotide and Amino Acid Sequences 

KKICEFIQNQWKKNLNIDVELENEEWTTYLNTKANGNre 

NELIKKSDLEL*DPIKRQDILRQAEEI I IEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 
t606.aa 

CCNNKERKEGVSFKI SLGAEPS SLDPQLAEDNVASKMIOTMFRGIVTGDPNTGGNKPGLA^ SSDGTVYTFNL 
REKITWSIXTVAITAEGIRKSYLRILNKE 

YFIDMLVHQSF I PVPVHVTEKYGQNWTSPENMVTSGPFKLKERI PNEKYVFEKNNKYYDSNEVELEEITFYTTNDS 
STAYKMYENEELDAI FGSI PPDLIKNLKLRSDYYSSAVNAI YFYAFNTHI KPLDNVKIRKALTLAIDRETLTYKVL 
DNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLA^ 

DVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLS I FTQGYTQFS SHNYSNPEYNELIKKSDLELDPI KRQ 
DILRQAEEI I IEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

f606.nt 

ATGAAATTACAAAGGTCATTATTTTTAATAATATTTOT 

AAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACCCTC 

ATCAAAAATGATTGACACAATGTTTAGAGGGATTGT^ 

GCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTT^ 

GAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATT^ 

TGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATT^ 

AGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAi^ 

AATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAA 

AAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTC 

TCAAATGAAGTAGAATTAGAAGAGATTACATTTTACA.CAACAAATC 

CTCATCAGCTGTTAATGCCATATACTT^ 

AAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTC 

GAGGAACTCCCAACTTTAGTTCATATTCTTAT^ 

TCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTCCAATTTO 

AAAAAAATTTCTGAATTTATTCAAAACCAATGGAAAAAAA 

GGACAACATACTTAAACACTAAGXSCAAATGGAAATT^ 

TCCTTTGACATTTTTAAGCATATTCACACAAGG 

AACGAACITATAAAGAAATCCGACCTTGAGCTTGATCCAATAAA 

TAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAA 

GACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATC 

t606.nt 

TGTTGTAATAACAAGGAAAGAAAAGAAGGAGTATCATTTAAAATAAGC^ 

CTCAATTAGCAGAGGATAATGTCGCATCAAAAATGATTGACACAA 

TACAGGGGGAAATAAACXTGGGACTTGCAAAAGGGTGGGATATTTC 

AGAGAAAAAATCACTTGGAGTGACGGAGTTGCAATCACTGCAGAAG^ 

ATAAAGAAACTGGCTCAAAGTACXSTTGAAATGGTTAAATCGGTAATTAAAAAT^ 

AGTGACTGACTCTGAACTTGGAATTAGAGCGATTG^ 

TATTTTATTGATATGTTAGTACACCAATCATTTATTCCAGTACCAG 

GGACAAGCCCCXSAAAACATGGTGACAAGTGGTCCT^ 

TGAAAAAAATAAC^VAATACTACGACTCAAATGAAGTAGAATTAGAAGAGATO 

TCAACAGCGTATAAAATGTATGAAAATGAAGAGCTAGATGCAAT^^ 

ATCTAAAATTAAGAAGCGACTATTACTCATC^ 

ACCACTTGACAACGTTAAAATTAGAAAAGCCTTAACTCT^ 

GACAACGGGACTACCCCTACAAGAAGAGCAACT^CCAACTTTAGTTC 

TTAATCCTOAAATTGCAAAAACCCTTCTAGCTGAAGCTGGATATC 

AAAATACAATACAAACGAAGCAAATAAAAAAATTTGTGAATTTATTCAAAA 

GATGTGGAACTTGAAAACGAAGAATGGACAACATACTTAAACACT 

CAGGATGGATAGGCGATTATGCTGATCCTTTGACATTTTTAAG 
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TCATAATTACTCAAACCCAGAATACAACGAACTTATAAAGAAA 
GACATTTTAAGACAAGCAGAAGAGATAATTATTGA 
GTTACCTTTTCAGAAATGACAAATGGACAGGGTGGAACACCAATAT^ 
ATTAAAAAATAAATAA 

f679.aa 

MFNRSSCVLQNFLFLFLFLSLVSCFAKKEISGNNFIKAHSKEFt)LJW 

YLFKKIGFEEKFVEYMKKAIANGDSIASQFAGIKLIEYFNSAKEYFASELIG 

KKNDKALSLLNKLDKMKFSDYQENENIIJ^ 

GANFLNLVRFKYEVANGNFNGAINILNKNG^ 

NLREKNNIX5LLLLKEYLEGLDLNNEINRLDLI^ 

ES IQLEDYGNLYKLYSNAQKVI SNSVLSKIAFINARLI YHKLIKPNVSGEYKSLLHSAVNYDKWS YSSFMSRYLLD 
QNIDEFFTGGSDIKYEQSDYEIFLEGFLKFNLCNYTOGF 

YLWQDESALMENDYKRLYPYLYGSLIEYWAKRRGLEASVVFSLIKAESSFEKNAVS 

KELKYFNYDLKI PKDNI I IGTYYLKKRI STTGSLYKAIASYNGGIGNVRKWEKSYGHLSKELF I EAI PFSQTRNYI 
KKI LVYSVFYDAL YEKKG I DSVIVKIMGEFPKNZ 

t679.aa 

CFAKKEISGNNFIKAHSKEFDLNNLNWLWNFDYTKKNFDKHFNID 

DSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFII 

NENILLKAVLYLNLSNVSESKIYFNELFENLPAN 

NILNKNGLNDYYDNNIVLSDVYKAFISSGKVSNALTFFSKIKSKYKN^ 

NEINRIJ)LLNTAFSNLIFTKSARDYFAESLPKFYTEGDKKNSTFIKI LEEYI LES IQLEDYGNLYKLYSNAQKVTS 
NSVLSKLAFINARLIYHKLIKPWSGEYKSLLHSA^ 
LEGFLKFNLO^YVRGFISEDFRNGYKFSLDFYRKVYDELLKSENYTO 
GSLIEYWAKRRGLEASVVFSLIKAESSFEKNAVSKPGAVGLMQVMPSTA 

LKKRI STTGSLYKALAS YNGGIGNVRKWEKS YGHLSKELFI EAI PFSQTRNYIKKI LVYSVFYDAL YEKKGI DSVT 
VKIMGEFPKNZ 

f679.nt 

ATGTTTAATAGAAGTTCTTGTGTATTAC 

CAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATT^ 

ATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCAOT 

TATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGA 

TTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATAT^ 

TGGAGAGAAGCTTTATAAAAAATACX3AAAATAATAAATTTATTATACTGG 

AAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAAT^ 

ATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTC 

TGAGAACTTACCTGCAAATTATTTACATGTAAGAGC 

GGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATG^ 

TAAATAAAAATGGTTTAAATGATTATTATGAG^TAACATTC 

TGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAA 

AACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAAT^ 

TTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTO 

AAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACT^ 

GAATCAATTCAGCTTGAAGACTATGGCAATCTT^ 

TTTTGTCTAAGCTTGCTTTTATTAATGCAAGGC 

GAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTC 

CAAAATATTGATGAATTTTTTACAGGTGGGTCTC 

GGTTTTTAAAATTCAATCTTTGTAATTATGT^ 

ACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTG 

TATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATC^ 

TGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTC 
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TGAAAAAAATGCTGTCTCAAAACCX3GCT 

AAAGAACTTAAGTATTTTAACTATGATTTAAAGAT^ 

AAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATG 

GGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTAT^ 

AAAAAAATATTAGTTTATTCGGTATTTTATGATG 

TTATGGGCGAATTCCCCAAAAATTAA 

t679.nt 

TGCTTTGCAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGC 
ATTGGTTATGGAATTTTGATTATACAAAAAAAAATTTTGAT^ 
TGTTGOTTATTTATTTAAAAAAATAGGATTTGAAGAGAAAT^ 
GATAGCATTGCATCCCAGTTTGCTGGGATTA 

AATTGATTGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATA 

TTGGCAAAAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTO 

AATGAAAATATTTTATTAAAAGCAGTTCTTTAC 

AGCTTTTTGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTT 

GTATTTTGGTGCAAATTTTTTAAATC 

AATATATTAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTC 

TTAGTTCTGGCAAAGTTTCAAATGCTTTAACATTTTTO 

TATTCTAAACCTTAGAGAGAAAAATAATTTAGGACTTC 

AATGAGATTAACAGGCTTGATTTGCTTAATACTG 

TTGCCGAAAGTTTACCCAAGTTTTATACCX5AGGG 

TATTTTGGAATCAATTCAGCTTGAAGACTATGGCAATCTTT 

AATTCrGTTTTGTCTAAGCTTGCTTTTATTAAT^ 

GAGAATACAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGG 

ATTAGATCAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTA^ 

TTGGAAGGGTTTTTAAAATTCAATCTTTGTAATTATGOT 

AATTTTCACTTGATTTTTATCGAAAAGTAT^^ 

GATTAATTATCTTGTAAATCAAGATGAATCTGC 

GGATCTTTGATAGAATATTGGGCTAAAAGGAGAGGGCT^ 

GTAGCTTTGAAAAAAATGCTGTCTCAAAACCGGGTGCTG 

TATTTCTAAAGAACTTAAGTATTTTAACTATGATTTAAAGATTC 

TTAAAAAAAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCITGCGTC 

GAAAGTGGGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTT^ 

TTATATTAAAAAAATATTAGTTTATTCGGTATTTTATGATGCT^^ 

GTTAAAATTATGGGCGAATTCCCCAAAAATTAA 

f 11-12. nt 

TAAAAGGAGA ATATTTTTAT GAGAAAAAGT TTGTTTTTAT ATGCATTATT AATGGGAGGA 
TTGATGTCTT GTAATCTAGA TTCCAAATTA TCTAGTAACA AAGAACAAAA AAATAACAAT 
AATGTAAAAG AAGTTTCGGA TAGTGTTCAA GAAGATGGTC TTAATGATTT ATATAATAAT 
CAAGAAAAGC AAAAAAGCTT TACTAAAAAT TTTGGAGAAC GGAAATATGA GGATTTAATT 
AATCCTATAG AGCCTATAAT ACCTTCAGAA TCACCAAAGA ATAAGGCTAA TATACCAAAT 
ATTTCAATTG CGCATACTGA AAAAAAAGAG ACAAAAAAGG AGAATTTAAT CCCTTCTACT 
AATGAAGAAA AGGAAGCTGA TGCAGCAATT AAATATTTAG AAGAAAATAT TCTTAAAAAC 
TCTAAATTTT CTGAATTAAT TAGAGAAGTA CGTGTAATTA AAGATGAATA TGCTTTAATA 
AAAGCTGATT TGTATGATGT AATTGGAAAG ATTAACAATA AAAAAACATC ATTAATGGAG 
AATCCTAAGA ACAATAGAGA TAAGATAAAT AAATTAACAC AATTGTTGCA AAATAATTTA 
AAGATAGATA GTGAACTTGA GCAGCTTATA AATATGATTG ATATGGCAGA AAATGAAATA 
AGCTCTGCGG CTTTCTTTTT TGACAACGCT CAGAAAAGGT TAAAAGAAAG CATTATTAAA 
AGATTAGAGA GTAAAAATAA TAGATCTTAT GCATTAAAAT TGTCTAGACA GGCTTTAAGT 
GACGCAAGAA GTGCTTTAAG TAATTTAGAA TCTTTTGCCT CTAAAAGAAT TGAACCAATG 
GTGAGAAAGG AAGAAATAAA AGAGCTTATT AAACATGCAA AAACTGTTTT AGAAAGTCTC 
AATAAAAAAT AA 
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tll-12.nt 



TTGTAATCTAGATTCCAAATTATCT 

GTTCAAGAAGATGGTCTTAATGATTTATATAATAATC^ 

GGAAATATGAGGATTTAATTAATCCTATAGAGCCTATAA^^^ 

AAATATTTCAATTGCGCATACTGAAAAAAAAGAGACAAAAAAGGA 

GAAGCTGATGCAGCAATTAAATATTTAGAAGAAAATATTCTTAAAAACTCT 

TACGTGTAATTAAAGATGAATATGCTTTAATAAAAGCTGATTTGTATGATC 

AACATCATTAATGGAGAATCCTAAGAACAATAGAGATAAGATAAATAAATTAACACAATTG 

AAGATAGATAGTGAACTTGAGCAGCTTATAAATATGATTGA 

TTTTTGACAACGCTCAGAAAAGGTTAAAAGAAAGCATTATTAAAA 

ATTAAAATTGTCTAGACAGGCTTTAAGTGACGCAAGAAGTGCT^ 

ATTGAACCAATGGTGAGAAAGGAAGAAATAAAAGAGCTTATTAAACATGC 

AAAAA 



f 11-12. aa 



KENIFMRKSL FLYALLMGGL MSCNLDSKLS 
EKQKSFTKNF GERKYEDLIN PIEPIIPSES 
EEKEADAAIK YLEENILKNS KFSELIREVR 
PKNNRDKINK LTQLLQNNLK IDSELEQLIN 
LESKNNRSYA LKLSRQALSD ARSALSNLES 
KK 



SNKEQKNNNN VKEVSDSVQE DGLNDLYNNQ 
PKNKANIPNI SIAHTEKKET KKENLIPSTN 
VTKDEYALIK ADLYDVIGKI NNKKTSLMEN 
MIDMAENEIS SAAFFFDNAQ KRLKESIIKR 
FASKRIEPMV RKEEIKELIK HAKTVLESLN 



tll-12.aa 

CNIJ)SKLSSl^QKNNNNVKEVSDSVQa 
NISIAHTEKKETKKENLIPSTOEEKEADAAIKY^ 

TSLMENPKNNRDKINKLTQLLQNNLKIDSELEQLINMIDMAENEISSAAF 
LKLSRQALSDARSALSl^ESFASKRIEPMVRKEEIKELIKHAKTVLE 



fll-4.nt 



TAAAGGAGTT TACAAATGAG TAAACTAATA 
TGTAAATGGT ATGTAGACAA TACCATTGAT 
ACATCTATTG ATCAAGTATT AGATGAGATA 
ATCACAAAAT TAACTCCGGA AGAGCTAGAA 
GAAAAATCCA AAAAAGAAAT TGAAGATCAA 
GTAAAGGATA CTCCTCGCTT AATCAAATTG 
GTTTTTCAAA CACTAATTAA TATAGGTTAT 
AAGAATGGAC TAAAGATGGT GAAATTACTG 
AATGGTGATA AAAGTACCCA AAAATACAAT 
GCTGAAAATT CGGTAAGCGT TTCTTTTAAA 
AAATGTATTC AAACTCTTAT GAAAAATGTA 
CTTAAAAACA AAAATGATGG TGAGTACGAA 

tll-4.nt 



TTGGCAATAT CTATACTGCT AATAATTTCA 
GAAGCAACTG TAGAAAGTAA ATCAGCACTA 
AGTGAAGCCA CAGGCCTAAG TTCGGAAAAA 
AATTTAGCAA AGGAAGCTCA AGATGACTCT 
AAAAATACCA AGGAAAGTAA AAACATAGAA 
ATAAAGAATT CATCAGAAAA AATTGATTCG 
AATGCTACCT ATGCAGCCAA AAGTAATTTG 
GATGAGTTGC TAAAAATATC GGTAAGTAGC 
GAACTTAAAA CCGTTGTAAA TAAGTTTAAT 
GAACATTCAA ACAGTAAAAT TGAAACTAAA 
GAAACATACT TTGAAGGTGT ATGCAGCGAA 
AAAACATTGA CAACTTTAAG CTAA 



ATGTAAATGGTATGTAGACAATACCATTGATGAAGCAACTGTAGAAAGTA 
GTATTAGATGAGATAAGTGAAGCCACAGGCCTTAAGTTC 
ATTTAGCAAAGGAAGCTCAAGATGACTCTGAAAAATCC^ 
TAAAAACATAGAAGTAAAGGATACTCCTCGCTTA^ 

TTTCAAACACTAATTAATATAGGTTATAATGCTACCTATGCAGCCAAAAGT 
TGAAATTACTGGATGAGTTGCTAAAAATATCGGTAAGTAGC AATGGTGATA 
TAAAACCGTTGTAAATAAGTTTAATGCTGAAAATTCGGTAAGCGTTTCTT^ 
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GAAACTAAAAAATGTATTCAAA^ 
ACAAAAATGATGGTGAGTACGAAAAA 

fll-4.aa 

RSLQMSKLIL AISILLIISC KWYVDNTIDE ATVESKSALT SIDQVLDEIS EATGLSSEKI 
TKLTPEELEN LAKEAQDDSE KSKKEIBDQK NTKESKNIEV KDTPRLIKLI KNSSEKIDSV 
FQTLINIGYN ATYAAKSNLK NGLKMVKLLD ELLKISVSSN GDKSTQKYNE LKTWNKFNA 
ENSVSVSFKE HSNSKIETKK CIQTLMKNVE TYFEGVCSEL KNKNDGEYEK TLTTLS 

tll-4.aa 

CKVmTONTIDEATVESKSALTSIDQVLDEISEATC 

KNIEVKDTPRLIKLIKNSSEKIDSWQTLINIGYNATYA^ 

KTVVNKFNAENSVSVSFKEHSNSKIETKKCIQTLMKNVETYFEGVCSELK^ 

f 112-1. nt 

TGAATCTCTA AAGATTTTAG CAGGGGAGAA AATATGAAAA AAAGTTTTTT ATCAATATAC 
ATGTTAATTT CAATAAGTTT ATTATCATGT GATGTTAGTA GATTAAATCA GAGAAATATT 
AATGAGCTTA AAATTTTTGT TGAAAAGGCC AAGTATTATT CTATAAAATT AGACGCTATT 
TATAACGAAT GTACAGGAGC ATATAATGAT ATTATGACTT ATTCGGAAGG TACATTTTCT 
GATCAAAGTA AGGTTAATCA AGCTATATCT ATATTTAAAA AAGACAATAA AATTGTTAAT 
AAGTTTAAGG AGCTTGAAAA GATTATAGAA GAATACAAAC CTATGTTTTT AAGTAAATTA 
ATTGATGATT TTGCGGGATC CGTT 

tll2-l.nt 

ATGTGATGTTAGTAGATTAAATCAGAGAAATATTAATC^ 
ATAAAATTAGACGCTATTTATAACGAATGTACAGG 

CTGATCAAAGTAAGGTTAATCAAGCTATATCTATATTTAAAAAAGACAATAA 
TGAAAAGATTATAGAAGAATACAAACCTATGTTTTTAAGTAA^ 

f 112-1. aa 

ISKDFSRGEN MKKSFLSIYM LISISLLSCD VSRLNQRNIN ELKIFVEKAK YYSIKLDAIY 
NECTGAYNDI MTYSEGTFSD QSKVNQAISI FKKDNKIVNK FKELEKIIEE YKPMFLSKLI 
DDFAGSV 

tll2-l.aa 

CDVSRLNQRNINELKI FVEKAKYYSI KLDAI YNECTGAYNDIMTYS EGTF SDQS KVNQAI S I FKKDNKI VNKFKEL 
EKIIEEYKPMFLSKLIDDF 

fl4-8.nt 

TAAATACAGA GCCATTCAAG GAGAGTATTT ATGAAATACT ATATATGTGT GTGTGTTTTT 
TTGCTTTTGA ATGCTTGCAA TTCAGATTTT AGCACTAATC AAGAAGATAT TAAATATCCA 
TCTGATAAAG AGAAATCAAA ATCCAACATG GAAGCAAGCT CTAAAGAAGA AGATCCAAAT 
AAAAAAATAA AAAATACACT GCTTAATGAT TTAATAAATT TGATAGAAAT AGCTAATGAG 
CATAAAGAAA AATATGAAAA AAGAATGCAA GAAGAACCTT CAGATCAATA CGGAATATTG 
GCTTTCCAGG AATTAGACTT GTCCGTTGGA AAAATATCTG AAGACACCCC GCAATCTAAA 
AAATTTAGAA AAAACACCTA TTCTCCCTTA AGCGCTATTG ATGTCAATAA ATTAAAAGAT 
CTTTCAGAGA TTATAAGAAA TTCGGGCCAA ATACAAGGTT TATTTAATAT TTTCAACAGA 
TTCGGAGGCA TTTTTGACGA CTCACTTAAT CACGTATATT CTAAAAAAGA TATCCTAGGG 
GGACTAGAAA TTTTGGATTT AGATAAACTA AAAAATTCGT TTGAAAAATT ACTATCTATA 
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AAAGAAACTT TCTCAAAAAT GCTAAATCAA 
CATATACGAA CAGAGACAAA TAAACTTAAA 
GATAAAAAAG AAGACGAAGC ATATGAACCT 
TAA 



CTTTTATTAG ATTATAAAAA TGATAAAGAT 
TCTCATACAA CTGCACTTTT CGAACAACTT 
AAAAATCAGA TATTTTCAAT AAGTAACCTT 



tl4-8.nt 



TTGCAATTCAGATTTTAGCACTAATCAAGAAGATATTAAATATCCATCTGATA 

GAAGGAAGCTCTAAAGAAGAAGATCCAAATAAAAAAATAAAAAATACACTGCTTAATGA 

AAATAGCTAATGAGCATAAAGAAAAATATGAAAAAAGAATGCAAGAAGAACCTTC 

TTTO^GGAATTAGACTTGTCTOTTGGAAAAATATCTGAAC^ 

TATTCTCCCTTAAGCGCTATTGATGTCAATAAATTAAAAGATCTTTC 

AAGGTTTATTTAATATTTTCAACAGATTCGGAGGCATTT^ 

TATCCTAGGGGGACTAGAAATTTTGGATTTAGATAAACTAAAAAATTC 

ACTTTCTCAAAAATGCTAAATCAACTTTTATTAGATTATAAAAA 

AACTTAT^TCTCATACAACTGCACTTTTCGAACAACTTGATAA 

G 



fl4-8.aa 

IQSHSRRVFM KYYICVCVFL LLNACNSDFS 
KIKNTLLNDL INLIEIANEH KEKYEKRMQE 
FRKNTYSPLS AIDVNKLKDL SEIIRNSGQI 
LEILDLDKLK NSFEKLLSIK ETFSKMLNQL 
KKEDEAYEPK NQIFSISNL 



TNQEDIKYPS DKEKSKSNME ASSKEEDPNK 
EPSDQYGILA FQELDLSVGK ISEDTPQSKK 
QGLFNIFNRF GGIFDDSLNH VYSKKDILGG 
LLDYKNDKDH IRTETNKLKS HTTALFEQLD 



t!4-8.aa 



CNSDFSTNQEDIKYPSDKEKSKSNMEASSKEEDPNK^^ 
FQEI^LSVGKISEDTPQSKKFRKNTYSPLSAIDVNKL^ 

I LGGLEI LDLDKLKNSFEKLLS I KETFSKMLNQLLLDYKNDKDHIRTETNKLKSHTTALFEQLDKKEDEAYEPKNQ 



f!7-6.nt 



TAAAGGAGGG TATTTATGAA ATACCACATA 
GCTTGCAGGC CGGATTTTAA TATCGATCAA 
TCAAGGCCCA AAACTGAAAG CTCTAAGCAA 
CTTAAGAAAA AACAACAAGA AGAAGAGCTT 
AAAAAACAAC AAGAAGAAGA GCTTAAGAAA 
AGAAAACAAC AACTAAAAAA TACGCTATCT 
TACAATTTTA AAGAAAAATA TGTAAAAAGT 
ATGACGTCTT TTAGGGGATT GAATTGGGGG 
GAAAGATCTA TAAGATATAG AAGACACACT 
GAATTAAAGG AATTCGCAAA TATTATTCAA 
ATATTTAATT CTTTTAGCGC TATTGGAGGA 
TTCAAAAAAG ACAATCTAGA CAAACTAGAT 
TTTGAACAAA TATTATATAT AAAAGGAAGT 
GATTATAAAA ATCTAAAAAC AGATATTAAT 
AATGGAATTA AGCAACAAGC TCTAGAAGCA 
TATAAACTTT AA 



ATTACAACTA TATTTGTTTT TCTGTTTTTA 
AAAGACATTA AATACCCGCC TACTGAAAAA 
AAAGAATCAA AGCCTAAAAC AGAAGAAGAG 
AAGAAAAAAC AACAAGAAGA AGAGCTTAAG 
AAACAACAAG AAGAAGAGAA GGAAGAACTA 
AATGATTTAA AAAAGCAAAT AGAATCGGCC 
ATGGAAAAAG AACCTGAAGA CCATTACGGG 
CCAGGGACTG AAGATATATC TGACAATACC 
TATACTGTTT TAAGCCCCCT GGATCCTCAT 
GATATAAATA AACTAGCATC AGTAGCAAGT 
GCTCTTGACA TAGTAAGTGA TCACCTATAT 
ATTGCAGATT TAGAAATACT TAAAAATTCA 
GTTGCAGGAA AAGCAAAAAA ACTTTTATTA 
AAGCTTAAAT CTTATTCAAA TGAACTGGTT 
GAAAATCTAG AAGAGCTTAT AGTGTCAAAA 



tl7-6.nt 



TTGCAGGCCGGATTTTAATATCGATCAAAAAGACATTAAATACCCGCCTACTC 
AGCTCTAAGCAAAAAGAATCAAAGCCTAAAACAGAAGAAGAGCTTAAGAAAAAACAACAAGAAG 
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AAAAAGAACAAGAAGAAGAGCTTAAGAAAAAACAACAAGAAGAAGAC^ 

GGAAGAACTAAGAAAAGAACAACTAAAAAATACGCTAT^^ 

TTTAAAGAAAAATATGTAAAAAGTATGGAAAAAGAACCT^ 

ATTGGGGGCCAGGGACTGAAGATATATCTGACAATACCG 

TTTAAGCCCCCTGGATCCTCATGAATTAAAGGAATTCGCAAATATC 

GCAAGTATATTTAATTCTTTTAGCGCTATTGGAGG 

ACAATCTAGACAAACTAGATATTGCAGATTTAGAAAT^ 

AAGTGTTGCAGGAAAAGCAAAAAAACTTTTATTAGATTATAAAAATCTAAAA^ 

TATTCAAATGAACTGGTTAATGGAATTAAGCAAC 

AATATAAACTT 

fl7-6.aa 

RRVFMKYHII TTIFVFLFLA CRPDFNIDQK DIKYPPTEKS RPKTESSKQK ESKPKTEEEL 
KKKQQEEELK KKQQEEELKK KQQEEELKKK QQEEEKEELR KQQLKNTLSN DLKKQIESAY 
NFKEKYVKSM EKEPEDHYGM TSFRGLNWGP GTEDISDNTE RSIRYRRHTY TVLSPLDPHE 
LKEFANIIQD INKLASVASI FNSFSAIGGA LDIVSDHLYF KKDNLDKLDI ADLEILKNSF 
EQILYIKGSV AGKAKKLLLD YKNLKTDINK LKSYSNELVN GIKQQALEAE NLEELIVSKY 
KL 

tl7-6.aa 

CRPDraiDQKDIKYPPTEKSRPKTESSKQKESKPKTEEELKKKQQEEELKKKQQEEEL^ 
EELRKQQLKNTLSNDLJOCQIESAYNFKEKYVKSMEKEPEDHYGMTSFRG 

LSPLDPHELKEFANI IQDINKIiASVASI FNSFSAIGGALJDIVSDHLYFKKDNIjDKLD I ADLEI LKNSFEQILYIKG 
SVAGKAKKLLLDYKNLKTDI^^a J KSYSNELWGIKQQALEAENLEELIVSKYKL 

fl9-2.nt 

TAAAGAAAGA TTAAATCATA TTCAAGGAGA GTATTTATGA AACACTATAT AATTGTGCAT 
ATATTTGTTT TTCTATTTTT AAATGCTTGT TATCCAGTTG CATCTAATAA AATAGAATTA 
AAACCTAAAA CAGAAACAAG CTTAAATCAA GAAGAAGTCC CAAATCAAGA AGCAAACTAC 
AAAGAAGAAA AAGAAGGAAA AGAAGAAGGC ATTAATAAAA AAACAGAAAA CACGCTGCTT 
AATGATTTAA GAAATTTAAT AGAAACAGCT AAAAAAGATA ATGATAAATA TACACAAAAG 
TTAAAAGAAG AATCCTCAAG CCAATACGGA ATACTGGCTT TCAAAGATTT GTTCTGGCTA 
GATGGAACAA ATGAACAATT GTCCGCAAAT ACCGAAAGAT CTAAAGCCTA TAGAAAACGA 
GCTTATAGCA TCTTAAATAC TATTAATGAC GCTTCCTTAA AGAATTTTTC AGAAATTGTA 
ATGGCATCAG GACAAACACA GGGCATATTT AATACCCTTA ACTCACTTGG GGGTAATTTT 
GAAAAGATAG TTAATTGTTT GTATCCCAAA AAAGACAATT TGGAAAAATT AGAGACTTCA 
GTTTTAAAAA AGCTTAAAGA TTCTTTGGAA AATTTTTTAG AGATAAAAAA AATCGCCTCA 
GAAATGATGC ACAAGCTCTT ATTAGACTAT CAAAATAATA CAAATCGTAT ACAAACAGAT 
AAAAATGAAC TTAAGTCTTA TGCAGACACA CTTTTCAATC AAATGACAAA AAAACCCGAA 
GAAGCACTAA AGCTAAAAAA TACCATATGC TCAATAGAGG ACCTTTAA 

tl9-2.nt 

TTGTTATCCAGTTGCATCTAATAAAATAGAATTAA 

AATCAAGAAGCAAACTACAAAGAAGAAAAAGAAGCAAAAGAAGAAGGCAT^ 

TTAATGATTTAAGAAATTTAATAGAAACAGCTAAA 

CTCAAGCCAATACGGAATACTGGCTTTCAAAGATTTGT^ 

ACCGAAAGATCTAAAGCCTATAGAAAACGAGCTO 

TTTCAGAAATTGTAATGGCATCAGGACAAACACAG 

AAAGATAGTTAATTGTTTGTATCCCAAAAAAGAC^ 

GATTCTTTGGAA7VATTTTTTAGAGATAAAAA 

ATAATACAAATCGTATACAAACAGATAAAAATGAACJTTAAGTCTT 

AAAACCCGAAGAAGCACTAAAG 
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fl9-2.aa 



RKIKSYSRRV FMKHYIIVHI FVFLFLNACY 
EEKEAKEEGI NKKTENTLLN DLRNLIETAK 
GTNEQLSANT ERSKAYRKRA YSILNTINDA 
KIVNCLYPKK DNLEKLETSV LKKLKDSLEN 
NELKSYADTL FNQMTKKPEE ALKLKNTICS 



PVASNKIELK PKTETSLNQE EVPNQEANYK 
KDNDKYTQKL KEESSSQYGI LAFKDLFWLD 
SLKNFSEIVM ASGQTQGIFN TLNSLGGNFE 
FLEIKKIASE MMHKLLLDYQ NNTNRIQTDK 
IEDL 



tl9-2.aa 

CYPVASNKIELKPKTETSLNQEEVPNQEANYKEEKEAKETC 
SSQYGIIAFKDLFWLIXSTNEQLSANTE^ 
KIVNCLYPKKDNLEKLETSVLKKIjKDSLENFL^ 
KPEEALK 

f!9-4.nt 

TAATCTATAC TAATTGAGGA GAATATTTTT ATGAAAAACA ACATAATTTT ATGCATGTGT 
GTTTTTTTAC TTTTAAATAG CTGCACCGCT AACCATGAAG CTGAAGCGAA AATAAAAAAA 
CATGTTGATA AAACAAAAAA CGAATATATT AATGAAATAA AAAATTTAAT AGCAACAACC 
AAAGAAATCA TCGAAAAACG AAAATTGCTA CAAGCTAAAC CAGTAGATCA AAACCCCGTA 
GATGATACAA ACAATAAGAA AGTTTTCGAG ATAGATAAAA GAGCTTTCGA TTTTATAAAT 
AGTTTTTTAA CAGATGATGA ATTTAATAAA TTTGTAACAA TATTTCATAA ACCAACACTA 
AAATCACCCG GAAAAGTATT AAATAGCATA GCAATTCTAG AGCTAAACAT AGAGCAGGTA 
ATTAATCACC TAGACTCAAA AAATGAGACC TTAAATAAAG CAAGCTCTTT AGATTTGGAA 
AAGATCAAAA ATTCCCTTGA ACAGCTGTTC TCTATAAGGA ATTTTTTTTC AACAATCATA 
AAAAGGGTCT TATTAGATCA TCAAAACAAT GAAAATTCTA TAAAACCAGA TGATTCTAAA 
TCAGGAACCT ATTTCGATAC GATATACGAT CAGTTTAATG AAAAAAATAA AGAGGTTAGA 
AATCTGAAAA AAACCATATT ATCACTGCCG AATTAA 



tl9-4.nt 



CTGCACCGCTAACCATGAAGCTGAAGCGAAAATAAAAAAAC^ 
ATAAAAAATTTAATAGCAACAACCAAAGAAATCATCGAAAAACGAAAATTGCT 
ACCCCGTAGATGATACAAACAATAAGAAAGTT^ 
AACAGATGATGAATTTAATAAATTTGTAAGA^ 

AGCATAGCAATTCTAGAGCTAAACATAGAGCAGGTAATTAATCACCTAGACT^ 
CAAGCTCTTTAGATTTGGAAAAGATCAAAAATTCCC 
CATAAAAAGGGTCTTATTAGATCATCAAAACAATGAAAATTCTATAAAA 
TTCGATACGATATACGATCAGTTTAATGAAAAAAATA 



fl9-4.aa 



SILIEENIFM KNNIILCMCV FLLLNSCTAN 
EIIEKRKLLQ AKPVDQNPVD DTNNKKVFEI 
SPGKVLNSIA ILELNIEQVI NHLDSKNETL 
RVLLDHQNNE NSIKPDDSKS GTYFDTIYDQ 



HEAEAKIKKH VDKTKNEYIN EIKNLIATTK 
DKRAFDFINS FLTDDEFNKF VTIFHKPTLK 
NKASSLDLEK IKNSLEQLFS IRNFFSTIIK 
FNEKNKEVRN LKKTILSLPN 



tl9-4.aa 



CTANHEAEAKIKKHVDKTKNEYINEIKNLIATTKE 
TDDEFNKFVTIFHKPTLKSPGKVLNSIAILE^ 
IKRVLLDHQNNENS IKPDDSKSGTYFDTI YDQFNEKNKEVRNLKK 



fl9-6.nt 
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TAAAGGAGAG TATTAATGAA ATGCCATATA ATTGCAACTA TATTTGTTTT TCTATTTTTA 
GCTTGCAGTA CAGATTTTAA TACTGATCAA AAAGGCATTA AATACCCGCC TACCGAAAAA 
TCAAAGCCCA AAACTGAAGA CTCTAAGCAA AAAGAATTAA AGCCTAAAAC AGAAAAAGAA 
CTAAAGAAAA AACAACAACT AAAAAATAAA CTACTTAATG ATTTAAAAAA TTCAATAGAA 
ACAGCTAATA AGCATAAAGA AAAGTATAAA AAAAGAATGA AAGAAGAACC CGAAGATCAA 
TACGGGGTAC AGGCTTTCAA AGGATCGAAT TGGGGGCCGG GGACTGAAGA TGTATCTGCC 
AACACCGAAA GATCTATAAG ATTTAGAAGA CATACTTATA CTATTTTAAG CACGCTGAGT 
CTTCATGAAT TAAAGGAATT CTCAAATATT GTTACAAATG AAAATAAACT GGTGCCAGTA 
GTAGATATGT TTAATTTCTT TAGCTCTATT GGGACAGCTC TTGATATAAC AACCGATAGC 
TTATATCCCA AAAAGACAAT CTGGACAAAC CAGATCTGTC GGATTTAG 

tl9-6.nt 

TTGCAGTACAGATTTTAATACTGATCAAAAAGGCATTAAATACCCGCCTACCGAAAAATCAAA 
GACTCTAAGCAAAAAGAATTAAAGCCTAAAACAGAAAAAGAAC 
TTAATGATTTAAAAAATTCAATAGAAACAGCTAA 
CGAAGATCAATACGGGGTACAGGCTTTCAAAGGATCGAATT^ 

GAAAGATCTATAAGATTTAGAAGACATACTTATACTATTTTAAGCACGCTGAGTCTTCATO 

CAAATATTGTTACAAATGAAAATAAACTGGTGCCAGTAGTAGATATGTTTAA 

TCTTGATATAACAACCGATAGCTTATATCCGAAAAAG^ 

fl9-6.aa 

RRVLMKCHII ATIFVFLFLA CSTDFNTDQK GIKYPPTEKS KPKTEDSKQK ELKPKTEKEL 
KKKQQLKNKL LNDLKNSIET ANKHKEKYKK RMKEEPEDQY GVQAFKGSNW GPGTEDVSAN 
TERSIRFRRH TYTILSTLSL HELKEFSNIV TNENKLVPW DMFNFFSSIG TALDITTDSL 
YPKKTIWTNQ ICRI 

tl9-6.aa 

CSTDFNTDQKG I KYPPTEKSKPKTEDSKQKELKPKTEKELKKKQQLKNKLLNDLKNS I ETANKHKEKYKKRMKEEP 
EDQYGVQAFKGSNWGPGTEDVSANTERSIRFRiOT 
LDITTDSLYPKKTIWTNQICR 

f21-4.nt 

TAGGAGACAA TCTTTATGAA 
CTGATAAGTT CTTGTAAGAA 
TTAGAAAGTT CAGAACAAAA 
GGATTTTTAG AAATTTTAGA 
AAACAAATTC AAGAATTAAA 
GAAACATATT CTGGGTATGA 
GGACTTGAAG ATAAATTAAA 
AAAAAAGCTT TACAAGAGGC 
GCAACTGGAG TAACGCATGG 
TGGCAGTGTG CTAATAGTTT 
ATGACCAATG AAGTTATAAC 
GGAGAAACTG TAGAAGGTAA 

t21-4.nt 

TTGTAAGAATGATGTAACTAGTAAAGATTTAGAAGGGGC^ 
AAAAC7VGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTA 
ATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAATAAGATA 
TGAAACATATTCTGGGTATGAAGAAAAAATAAACAAAATAAAAG 



TAAAAAAATA AAAATGTTTA 
TGATGTAACT AGTAAAGATT 
TGTAAAAAAA ACAGAACAAG 
GACAAAAGAT TTAAACACAT 
GAATAAGATA GAAAAATTAG 
AGAAAAAATA AACAAAATAA 
TGAACTTTCA GAGAGCTTAA 
TAAAAAGAAA TTTGAAGAGT 
TTCTCAAGTC CAAAGACAAG 
GGGGTTTAAA AATATGACTA 
TAATTCGCTT AAAAAGATTG 
AAAAGAATAA 



TTATTTGTGC TATTTTTATG 
TAGAAGGGGC GGTGAAAGAT 
AGATAAAAAA ACAAGTTGAA 
TAGATACAAA AGAAATTGAA 
ACTCTAAAAA AACTTCTATT 
AAGAAAAATT AAACGGAAAA 
AAAAGAAAAA AGAGGAGAGA 
ATAAAAACCA AGCTGAATCT 
GTGGTGTTGG ATTACAAGCT 
GTGGTAATAA TACTAGCGAT 
AAGAAGAACT TAAAAATATT 
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TTAAATGAACTTTCAGAGAGGTTAAAAAAGAAAAAAGAGGAGAGAAAAAAA 

TTGAAGAGTATAAAAACCAAGCTGAATCTGC^CTGGAGTAACGCATGGT^ 

TGGATTACAAGCTTGGCAGTGTGCTAATAGTTTGGGGTT^ 

ACC^TGAAGTTATAACTAATTCGCTTAAAAAGATTGAAGAAGA^ 

AAAAAGAA 

f21-4.aa 

ETIFMNKKIK MFIICAIFML ISSCKNDVTS KDLEGAVKDL ESSEQNVKKT EQEIKKQVEG 
FLEILETKDL NTLDTKEIEK QIQELKNKIE KLDSKKTSIE TYSGYEEKIN KIKEKLNGKG 
LEDKLNELSE SLKKKKEERK KALQEAKKKF EEYKNQAESA TGVTHGSQVQ RQGGVGLQAW 
QCANSLGFKN MTSGNNTSDM TNEVTTNSLK KIEEELKNIG ETVEGKKE 

t21-4.aa 

CKNDWSKDLEGAVKDLESSEQNVKKTEQEIKKQVTC 
ETYSGYEEKINKIKEKLNGKGLEDKLNEIiSESLKKKKEERKKALQEA 
GLQAWQCANSLGFKNMTSGNNTSDMTNEVITNSLKKI EEELKNIGETVEGKKE 

f24-l.nt 

TAAGCTGGTA ACACTGTAAA GACAGCTGAG GGGGCTTCAA GTGGTACTGA TGCAATTGGA 
GAAGTTGTGG ATAATGATGC TAAGGTTGCT GATAAGGCGA GTGTGACGGG GATTGCTAAG 
GGGATAAAGG AGATTGTTGA AGCTGCTAGG GGGAGTGAAA AGCTGAAAGT TGCTGCTGCT 
AAAGAGGGCA ATGAAAAGGC AGGGAAGTTG TTTGGGAAGG CTGGTGCTAA TGCTCATGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
TTAAGTGCGA TTGTTAAGGC TGCGGATGCG GCTGAGCAGG ATGGAAAGAA GCCTGCAGAT 
GCTACAAATC CGATTGCTGC TGCTATTGGG AATAAAGATG AGGATGCGGA TTTTGGTGAT 
GGGATGAAGA AGGATGATCA GATTGCTGCT GCTATTGCTT TGAGGGGGAT GGCTAAGGAT 
GGAAAGTTTG CTGTGAAGAA TGATGAGAAA GGGAAGGCTG AGGGGGCTAT TAAGGGAGCT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GGTGCTGCGA AGGCTGCTGA TAAGGATAGT 
GTGAAGGGGA TTGCTAAGGG GATAAAGGAG ATTGTTGAAG CTGCTGGGGG GAGTGAAAAG 
CTGAAAGCTG CTGCTGCTGA AGGGGAGAAT AATAAAAAGG CAGGGAAGTT GTTTGGGAAA 
GTTGATGGTG CTGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGAGCAGGAT 
GGAGAGAAGC CTGAGGATGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGGG 
GATGGTGCGG AGTTTGATCA GGATGAGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGGTAATAA TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGAAGTTAGC GAGTTGTTGG ATAAGCTGGT AACAGCTGTA 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GATGCAATTG GAGAAGTTGT GGATAATGNT 
GCNAAGGNTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTN GGGGGAGTGA AAAGCTGAAA GTTGCTGCTG CTANAGNGGN NAATAATAAA 
GAGGCAGGGA AGTTGTTTGG GAAGGCTGGT GCTGATGCTA ATGGGGACAG TGAGGCTGCT 
AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT AGTGGGGAGC AGATATTAAG TGCGATTGTT 
AAGGCTGCGG CTGCTGGTGC GGCTGATCAG GATGGAGAGA AGCCTGGGGA TGCTAAAAAT 
CCGATTGCTG CTGCTATTGG GAAGGGTAAT GCGGATGATG GTGCGGATTT TGGTGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAAGGA TGAGAAAGGG AAGGCTGAGG GGGCTATTAA GGGAGCTAGC 
GAGTTGTTGG ATAAGCTGGT AAAAGCTGTA AAGACAGCTG AGGGGGCTTC AAGTGGTACT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GCGAAGGCTG CTGATAAGGA TAGTGTGACG 
GGGATTGCTA AGGGGATAAA GGAGATTGTT GAAGCTGCAG GGGGGAGTGA AAAGCTGAAA 
GTTGCTGCTG CTAAAGGGGA GAATAATAAA GGGGCAGGGA AGTTGTTTGG GAAGGCTGGT 
GCTAATGCTC ATGGGGACAG TGAGGCTGCT AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT 
AGTGGGGAAC AGATATTAAG TGCGATTGTT AAGGCTGCTG GTGAGGCTGC TGGTGATCAG 
GAGGGAAAGA AGCCTGAGGA GGCTAAAAAT CCGATTGCTG CTGCTATTGG GGATAAAGAT 
GGGGATGCGG AGTTTAATCA GGATGGGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
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GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGATGGTGG TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGGAGTTAGC GAGTTGTTGG ATAAGCTGGT AAAAGCTGTA 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GCTGCAATTG GAGAAGTTGT GGCTGATGCT 
GCTAAGGTTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGACAGTGA GGCTGCTAGC AAGGCAGCTG GTGCTGTTAG TGCTGTTAGT 
GGGGAGCAGA TATTAAGTGC GATTGTTAAG GCTGCGGCTG CTGGTGCGGC TGAGCAGGAT 
GGAGAGAAGC CTGCAGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTGATGGG 
GATGCGGATT TTGGTGAGGA TGGGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGCTGCAATT GGAGAAGTTG TGGATAATGC TGGTGCTGCG 
AAGGCTGCTG ATAAGGATAG TGTGAAGGGG ATTGCTAAGG GGATAAAGGA GATTGTTGAA 
GCTGCTGGGG GGAGTGAAAA GCTGAAAGCT GCTGCTGCTG AAGGGGAGAA TAATAAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGGT GCTGCTGGGG ACAGTGAGGC TGCTAGCAAG 
GCGGCTGGTG CTGTTAGTGC TGTTAGTGGG GAGCAGATAT TAAGTGCGAT TGTTAAGGCT 
GCGGATGCGG CTGAGCAGGA TGGAAAGAAG CCTGCAGATG CTACAAATCC GATTGCTGCT 
GCTATTGGGA ATAAAGATGA GGATGCGGAT TTTGGTGATG GGATGAAGAA GGATGATCAG 
ATTGCTGCTG CTATTGCTTT GAGGGGGATG GCTAAGGATG GAAAGTTTGC TGTGAAGGGT 
AATAATGAGA AAGGGAAGGC TGAGGGGGCT TCAAGTGGTA CTGATGCAAT TGGAGAAGTT 
GTGGATAATG ATGCGAAGGC TGCTGATAAG GCGAGTGTGA CGGGGATTGC TAAGGGGATA 
AAGGAGATTG TTGAAGCTGC TGGGGGGAGT GAAAAGCTGA AAGCTGTTGC TGCTGCTACA 
AGGGAGAATA ATAAAGAGGC AGGGAAGTTG TTTGGGAAAG TTGATGATGC TCATGCTGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
TTAAGTGCGA TTGTTACGGC TGCGGCTGCT GGTGAGCAGG ATGGAGAGAA GCCTGCAGAG 
GCTACAAATC CGATTGCTGC TGCTATTGGG AAGGGTAATG AGGATGGTGC GGATTTTGGT 
AAGGATGAGA TGAAGAAGGA TGATCAGATT GCTGCTGCTA TTGCTTTGAG GGGGATGGCT 
AAGGATGGAA AGTTTGCTGT GAAGAGTAAT GATGGTGAGA AAGGGAAGGC TGAGGGGGCT 
ATTAAGGAAG TTAGCGAGTT GTTGGATAAG CTGGTAAAAG CTGTAAAGAC AGCTGAGGGG 
GCTTCAAGCG GTACTGATGC AATTGGAGAA GTTGTGGCTA ATGCTGGTGC TGCGAAGGCT 
GCTGATAAGG CGAGTGTGAC GGGGATTGCT AAGGGGATAA AGGAGATTGT TGAAGCTGCT 
GGGGGGAGTA AAAAGCTGAA AGCTGCTGCT GCTGAAGGGG AGAATAATAA AAAGGCAGGG 
AAGTTGTTTG GGAAGGCTGG TGCTGGTGCT GGTGCTAATG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGGTTAG 

t24-l.nt 

TGGTGAGGCTGAGCAGGATGGAGAGAAGCCTGAGGATG 

GGGGATGGTGCGGAGTTTGATCAGGATGAGATGAAGAA 

TGGCTAAGGATGGAAAGTTTGCTGTGAAGGGTAAT^ 

CGAGTTGTTGGATAAGCTGGTAACAGCTGTAAAGACAGCTGAGGGGGCT^ 

GTTGTGGATAATGNTGCNAAGGNTGCTGATAAGGC^ 

AAGCTGCTNGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGC^ 

TGGGAAGGCTGGTGCTGATGCTAATGGGGACAGTGAGGCTGCTAGCA^ 

f24-l.aa 

AGNTVKTAEG ASSGTDAIGE WDNDAKVAD KASVTGIAKG IKEIVEAARG SEKLKVAAAK 
EGNEKAGKLF GKAGANAHGD SEAASKAAGA VSAVSGEQIL SAIVKAADAA EQDGKKPADA 
TNPIAAAIGN KDEDADFGDG MKKDDQIAAA IALRGMAKDG KFAVKNDEKG KAEGAIKGAA 
AIGEWDNAG AAKAADKDSV KGIAKGIKEI VEAAGGSEKL KAAAAEGENN KKAGKLPGKV 
DGAAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAEQDG EKPEDAKNPI AAAIGKGNGD 
GAEFDQDEMK KDDQIAAAIA LRGMAKDGKF AVKGNNEKEK AEGAIKEVSE LLDKLVTAVK 
TAEGASSGTD AIGEWDNXA KXADKASVTG IAKGIKEIVE AAXGSEKLKV AAAXXXNNKE 
AGKLFGKAGA DANGDSEAAS KAAGAVSAVS GEQILSAIVK AAAAGAADQD GEKPGDAKNP 
IAAAIGKGNA DDGADFGDGM KKDDQIAAAI ALRGMAKDGK FAVKKDEKGK AEGAIKGASE 
LLDKLVKAVK TAEGASSGTA AIGEWDNAA KAADKDSVTG IAKGIKEIVE AAGGSEKLKV 
AAAKGENNKG AGKLFGKAGA NAHGDSEAAS KAAGAVSAVS GEQILSAIVK AAGEAAGDQE 
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GKKPEEAKNP IAAAIGDKDG DAEFNQDGMK KDDQIAAAIA 
AEGAIKGVSE LLDKLVKAVK TAEGASSGTA AI GEWADAA 
AAGDSEAASK AAGAVSAVSG EQILSAIVKA AAAGAAEQDG 
ADPGEDGMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE 
AADKDSVKGI AKGIKEIVEA AGGSEKLKAA AAEGENNKKA 
AGAVSAVSGE QILSAIVKAA DAAEQDGKKP AD ATN P I AAA 
AAAIALRGMA KDGKFAVKGN NEKGKAEGAS SGTDAIGEW 
EIVEAAGGSE KLKAVAAATR ENNKEAGKLF GKVDDAHAGD 
SAIVTAAAAG EQDGEKPAEA TNPIAAAIGK GNEDGADFGK 
DGKFAVKSND GEKGKAEGAI KEVSELLDKL VKAVKTAEGA 
DKASVTGIAK GIKEIVEAAG GSKKLKAAAA EGENNKKAGK 
AAGAVSAG 

t24-l.aa 

GEAEQ3X3EKPEDAKNPIAAAIGKGNGDGAEFI*2DE^ 

ELLDKLVTAVKTAEGAS SGTDAI GEWDNXAKXADKASVTGI AKG I KE IVEAAXGS EKLKVAAAXXXNNKEAGKL.F 
GKAGADANGDSEAASK 

f28-2.nt 

TAAAAAGGAA ATATAAATAT TATGCGATTA TGTTTAATAA AAATTTTTAT TATACCTAAT 
TTAGTATTTA GTTCTCTTTT TTTATTTGAA AGTTGTTCTG GTTTTCTATC TAAAAAATCT 
ATAGAACAGT TTGCATTAGC ATTAAAAGAT CATCAAGAAA ATAAAAATAC TACTAATACT 
TCAGTAGATA AAAATAGTAA GGAAATTGAA TCTCCTAAAG ACGTTACATC ATCAAATAAA 
AAAACTTATG ATCCAATCTT ACAAGTAGGT TCTAATCAAC ATATGTCAGA TGATCCTGGT 
GCTAATAATA AAGAATCCCT ACCAAATTCA AGTCCAGCAA TAATACAAAA TGACTCGCAT 
GCTCAAAATA ATGTAAAGAT GGAAGAAAAT AAATCAGCTA CTCCACAACA TGATCCAATT 
GAACAAAGTA ATTTTAAAAA TAGCCTTACT ACAACAAGTA AAACTCCTGC TATTCCTTCA 
GAAGAAGAAA TTAAAGCTAA CTTAGATGAA TTTGCACAAG AAGAGTATGA GCAAACATCT 
CTTTCAGAAA TTAAAAATGC CACGCAAATT GTTAATCATG CTAATCCTGA AAACAAATTA 
AACAATACAC TCCTTGAGTT TGAAAAAGAT TATGAAACTT TATCAAACTT GTTATTCTCT 
AATTTAGACG CATCTCCTTT GAATAGAAAA ATAAAGACTA TTATGCCTAA ATTACAAGAA 
ATGCGTTCTT TTATGGAGCA AGCAACTAAT TCTTGGGTAT CTGCTAAAGG CATGCTAGAT 
GAGGCTAAGG ATAAACTAGC AGAATCTATT TATAAAAGAC TATACAATGG CAATTCATAC 
CGGTTCGGTG GCAGTTTTAA CGGACGTGAT ATGCAACATG CAAAAAATTT AGCATACAGA 
GCTATAGACT TTGCTTCTGC ATGCATTGAA TATACACAAA AAGCTATTGA TTATCTTCAA 
CAGGGAAATT CTTGCAAAAA AGAAATAGAA AATATATTCA AGCTTTAA 

t28-2.nt 

AAAAGATCATGAAGAAAATAAAAATACTACTAATACTTCAGTAGAT 

GACGTTACATCATCAAATAAAAAAACTTATGATO^ 

CTGGTGCTAATAATAAAGAATCCCTACX^AAATTCAAGTC 

TGTAAAGATGGAAGAAAATAAATCAGCTACTCCACAACATGATC 

ACTACAACAAGTAAAACTCXZTGCTATTCCTTC 

AGTATGAGCAAACATCTCTTTCAGAAATTAAAAATGCCACGCAAAl^ 

AAACAATACACTCCTTGAGTTTGAAAAAGATTATGAAACTTT 

CXnTTGAATAGAAAAATAAAGACTATTATGCCTAAATTACAAGAA 

CTTGGGTATCTGCTAAAGGCATGCTAGATGAGG 

TGGCAATTCATACCGGTTCGGTGGCAGTTTTAACGGACGT^ 

ATAGACTTTGCTTCTGCATGCATTGAATATACACAAAAAGCTA 

AAGAAATAGAAAATATATTCAAG 



LRGMAKDGKF 
KVADKASVTG 
EKPAEAKNPI 
GAIKGAAAIG 
GKLFGKVDGA 
I GNKDEDADF 
DNDAKAADKA 
SEAASKAAGA 
DEMKKDDQIA 
S SGTDAI GEV 
LFGKAGAGAG 



AVKDGGEKEK 
IAKGIKEIVE 
AAAIGKGDGD 
EWDNAGAAK 
AGDSEAASKA 
GDGMKKDDQI 
SVTGIAKGIK 
VSAVSGEQIL 
AAIALRGMAK 
VANAGAAKAA 
ANGDSEAASK 



f28-2 .aa 
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KGNINIMRLC LIKIFIIPNL VFSSLFLFES CSGFLSKKSI EQFALALKDH QENKNTTNTS 
VDKNSKEIES PKDVTSSNKK TYDPILQVGS NQHMSDDPGA NNKESLPNSS PAIIQNDSHA 
QNNVKMEENK SATPQHDPIE QSNFKNSLTT TSKTPAIPSE EEIKANLDEF AQEEYEQTSL 
SEIKNATQIV NHANPENKLN NTLLEFEKDY ETLSNLLFSN LDASPLNRKI KTIMPKLQEM 
RSFMEQATNS WVSAKGMLDE AKDKLAESIY KRLYNGNSYR FGGSFNGRDM QHAKNLAYRA 
IDFASACIEY TQKAIDYLQQ GNSCKKEIEN IFKL 

t28-2.aa 

KDHQENKinTNTSVDKNSKEI ESPKDVTSSNKKTYDPI LQVGSNQHMSDDPGANNKESLPNSSPAI IQNDSHAQNN 
VKMEENKSATPQHDPIEQSNFKNSLT^ 

NNTLLEFEKDYETLSNLLFSNLDASPLNRKI SAKGMLDEAKDKLAES I YKRLYN 

GNSYRFGGSFNGRDMQHAKNIiAYRAIDFASACI EYTQKAI DYLQQGNSCKKEIENI FK 

f28-3.nt 

TAGATGAATT TAATTGCTAA ATTATTTATT TTATCCACTT TAGTTTCAAT TCCAAATATC 
CTCTCTTGTA ACCTATATGA TAATCTTGCA GACAACGCTG AGCAGGTTAC AGACATACTA 
GACAACAACA AGTCTTTTAA TACTTTAGGA AGCAGCAATG AGAGTAGAAG TCGCAGGCCT 
AGAAGTACAA ATAATGCTTA TATGAAACAA AACATAGACA AAAATCATTT AGTTGTTGCA 
GATATGCAAA ATGATAATAG TAGCAGCAGT CTTCCCCAAC AAGTTAATAG TGAATCCAGT 
AAAGCTAATG AAGATAGTAA TATTATGAAG GAAATTGAAT CTTCTACAGA AGAGTGCGCT 
AGACTAAGAA AAGATTTAGA AACTATAAAA CAAATACTTG ATAATATAGA AAGCTTGCTT 
AATACAGCTA ATTCTTATTT AGAGAACGCT AGAAAAGCAC CTAAATCTAA TCAAGATAAT 
CAAACCTTAT TGCTTAGCCT GCACCAAGCT ATTGCTAAGG TTAAGAGTAG TCATACTTCT 
TTTATCATTT GTTATAATGA TGCATTTAAT TCCCTGGGAA TAGCTGATAC TGCCTTTAAA 
GATGCAAAGA GAAAGGCAGT TGAGGCATAA 

t28-3.nt 

TTGTAACCTATATGATAATCTTGCAGAC^ 

ACTTOAGGAAGCAGCAATGAGAGTAGAAGTCGCAGGCCTAG 

ACAAAAATCATTTAGTTGTTGCAGATATGCAAAATGATAATAGT^ 

ATCCAGTAAAGCTAATGAAGATAGTAATATTATGAAGGAAATTGAATCTTC 

AAAGATTTAGAAACTATAAAACAAATACTTGATAATATAGAAAGCTTGCT^ 

ACGCTAGAAAAGCACCTAAATCTAATCAAGATAATCAAACCTTATTGCT^ 

TAAGAGTAGTCATACTTCTTTTATC^TTTGTTATAATGATG 

AAAGATGCAAAGAGAAAGGCAGTTGAGGCA 

f28-3.aa 

MNLIAKLFIL STLVSIPNIL SCNLYDNLAD NAEQVTDILD NNKSFNTLGS SNESRSRRPR 
STNNAYMKQN IDKNHLWAD MQNDNSSSSL PQQVNSESSK ANEDSNIMKE IESSTEECAR 
LRKDLETIKQ ILDNIESLLN TANSYLENAR KAPKSNQDNQ TLLLSLHQAI AKVKSSHTSF 
IICYNDAFNS LGIADTAFKD AKRKAVEA 

t28-3.aa 

CNLYDNLADNAEQVTD I LDNNKSFNTLGS SNESRSRRPRSTNNAYMKQNI DKNHL WADMQNDNS SS S L PQQVNS E 

SSKANEDSNIMKEIESSTEECARLRKDLETIKQILDN^ 

KSSHTSFI ICYNDAFNSLGIADTAFKDAKRKAVEA 

f31-2.nt 

TAAAAAAATA AGGAGGTATT AATGAAAAGG AAAAGCAATA TATGTATTTC ACTTCTAGTC 
ACAATATTAT TTGTGTCTTG CAAGTTTTTT GGAAATAAAA GCGCAAGTAA AGAAAAAGAA 
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GAAACTTCTT TTTCTGATAC TGCTAGCAAG ATTAGTAAGT CGGGAACAGC TGCTTCTTCA 
GACAAACAAG AAAAAAATAC AAGTGATGTT ACAGGTGACG CCAAAAAGCA TACTAGTAGC 
CCTTACATGC TTGCTGATGC CCTTATTGTT AGTGATACTA CTAATAGAGA TAGAGATAAG 
CAAGAAAATA AAGATAAATT AAATGAAGAA GATAAAAAAA AGCTTAATGC TTTTTTTAGC 
ACAACTAAAA CATATCAATC TAGCCTAGAT TCCATTTATA ACAAATATAC AGGCTATTAT 
AATACCATTG ATACCTATGG CAGCTGTGAT ACGTATCGCA TTGAGTGTTT TAGTGTAGGA 
CCTTCTGAAA AACGTAAACA AGCTCTTGCT GATCTAGAGA AGTTAAAACT AGACGAAAAG 
TACACTCAGC TTAGCACAAT GTTAAAGAGT GCTGTGCCTA GTTATTACAA AAAAAATTTA 
GATGATTCTA TTGCACAGTA TAAGGAAGCC ATAAAGCAGG CTATTGAAGC TGAAAGTAAA 
ATAGAGACAG TAAAAGACTA TGCAACAGCT CAAAGTGCTG CCGATGACGA AAAGAAAAGA 
AATATAGATA ATTTAAAAAT AGTTAGAGAT GTTCTTCTTA TTATTAAAAA AACTATTGAG 
AAAGCCAGCC GATCTTATGC TGATGCTTTT GCTATTGCAA CATCTAGCTT ATCTTGTAGC 
GAATTTAAGC AAGCTGTTAA AGAGTTTAAT GATGCTGCTA AACAATATGC TAATGGAAAT 
AAAGGAGACA ATGCTGTCAA TGTTATTGTA GGCACTATTT CTAGTATGCC TTATGTCAAA 
TTTAAAGATG AGTTTGCAAG AGCAAAAATG TTTGCTCGTA ATTATAGAGG AGACGAGGTA 
GACAAGATGA TAAGAGCTAT CGACAAGCTG TGTGATGTTT ATAAAAAAGT TGCGCTTTAG 

t31-2.nt 

TTGCAAGTTTTTTGGAAATAAAAGCGCAAGT^ 

AGTAAGTCGGGAACAGCTGCTTCTTCAGACAAACAAGAAAAAAATACAAGTGATG 

ATACTAGTAGCCCTTACATGCTTGCTGATGCCCTTATTGTTAGTGATACT 

AAATAAAGATAAATTAAATGAAGAAGATAAAAAAAAGCTTAATGCTTTTTTO 

AGCCTAGATTCCATTTATAAGAAATATACAGGCTATTATAATACCATTGATACCTATG 

GCATTGAGTGTTTTAGTGTAGGACCTTCTGAA 

CGAAAAGTACACTCAGCTTAGCACAATGTTAAAGAGT^ 

ATTGCACAGTATAAGGAAGCCATAAAGCAGGCTATTGAAGCTGAAAGTAAAATAGAGACAGTAA 

CAGCTGAAAGTGCTGCCX2ATGACGAAAAGAAAAGAAATA 

TATTAAAAAAACTATTGAGAAAGCCAGCCGATC^ 

AGCGAATTTAAGCAAGCTGTTAAAGAGTTTAATGATGCTGCTAA 

CTGTCAATGTTATTGTAGGCACTATT^ 

GTTTGCTCGTAATTATAGAGGAGACGAGGTAGACAAGATGATAAGAGCTATCGAC^G 
f31-2.aa 

KNKEVLMKRK SNICISLLVT ILFVSCKFFG NKSASKEKEE TSFSDTASKI SKSGTAASSD 
KQEKNTSDVT GDAKKHTSSP YMLADALIVS DTTNRDRDKQ ENKDKLNEED KKKLNAFFST 
TKTYQSSLDS IYNKYTGYYN TIDTYGSCDT YRIECFSVGP SEKRKQALAD LEKLKLDEKY 
TQLSTMLKSA VPSYYKKNLD DSIAQYKEAI KQAIBAESKI ETVKDYATAQ SAADDEKKRN 
IDNLKIVRDV LLIIKKTIEK ASRSYADAFA IATSSLSCSE FKQAVKEFND AAKQYANGNK 
GDNAVNVIVG TISSMPYVKF KDEFARAKMF ARNYRGDEVD KMIRAIDKLC DVYKKVAL 

t31-2.aa 

CKFFGNKSASKEKEETSFSDTASKISKSGTAASSDKQEKNTSDVTGDAKK^ 
NKDKLNEEDKIOCLNAFFSTTKTYQSSLDSIYNKY^ 

EKYTQLSTMLKSAVPSYYKKNLDDSIAQYKEAIKQAIEAESKIETVKDYAT 

IKKTI EKASRSYAD AF AI ATS SLSCSEFKQAVKEFNDAAKQYANGNKGDNAVNVT VGTI S SMPYVKFKDEFARAKM 
FARNYRGDEVDKMIRAIDK 

f32-4.nt 

TAAGGAAATA TGAGGAATAT TAGCAATTGT ATCAAATATA TTATATTAAC AATGCTTATT 
GGATTATTAA TTTTTTGTTG TGCAACCTTT GTTTGGTTGA TTGGAATTTT TTATTCAAAT 
AACTTTAAAG AAGAGCGGAA TTATTCAATA AGCCCAATAG ATAGTGTTAT TATGCGTAAA 
TGTTATTTTA AAGAATTTAA GTCTGGACTT ATTAAAAGCG TATTCTTTAA GAAATTAGAT 
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GTAAATGTTA ACTCTAAAAA TTTTAAGGAG CTAAATAAGG TAGATAAACA AAATCTGCTA 
AATTCTTATC CATCTTATCA TATGGAGTTT GTCGTAGTTG ATAATGGATT TTTAATGAAT 
TTTAAAAATG TTATTTTTAA TGGTATAGAT GATGCTAAAT TATACGATCA ACGTGATATG 
GTTTACGGAG GATTTAGATA CTCAAAAGAG GCTTATTTCC AAATTATTGG CAATTATGAT 
GTTAAATTAA ATAAAATGAA ACAATATACT CCAGCAATTG TAGTAAATGT TTTCAAAATT 
AACATTAATG ATGCTTTATT TAACTCGTTA TTAAAGCAAA AAACTTTAAA AGTTACTTTG 
ATTTCCCATA ATAATAAAGA GTATATTTTA CAAACTAATA ATTTCTTATC AAAGTATAAT 
TTTCAAACAC CAGAAAAGGA GAATAGTTCT TACTAA 

t32-4.nt 

AAATAACTTTAAAGAAGAGCGGAATTATTCAATAAGCCCAATAGATAGTGTTATTA 
GAATTTAAGTCTGGACTTATTAAAAGCGTATTCTTTAAGAAATTAGATC 
AGCTAAATAAGGTAGATAAACAAAATCTGCTAAATTCTTATCCATCTTATCATATGGA 
TGGATTTTTAATGAATTTTAAAAATGTTAT^ 

GTTTACGGAGGATTTAGATACTCAAAAGAGGCTTATTTCCAAATTATTGGCAA 
TGAAACAATATACTCCAGCAATTGTAGTAAATGTTTTCAAAAT^ 
AAAGCAAAAAACTTTAAAAGTTACTTTGATTTCCCATAATAATAAAGAG 
TCZAAAGTATAATTTTCAAACACCAGAAAAGGAGAATAGTTCTTAC 

f32-4.aa 

GNMRNISNCI KYIILTMLIG LLIFCCATFV WLIGIFYSNN FKEERNYSIS PIDSVIMRKC 
YFKEFKSGLI KSVFFKKLDV NVNSKNFKEL NKVDKQNLLN SYPSYHMEFV WDNGFLMNF 
KNVIFNGIDD AKLYDQRDMV YGGFRYSKEA YFQIIGNYDV KLNKMKQYTP AIWNVFKIN 
INDALFNSLL KQKTLKVTLI SHNNKEYILQ TNNFLSKYNF QTPEKENSSY 

t32-4.aa 

NNFKEERNYSISPIDSVIMRKCYFKEFKSGLIKSWFKKUDVNVNSKNFKELN^ 
GFLMNFKNVIFNGIDDAKLYI^RD^ 
KQKTLKVTLI SHNNKEYILQTNNFLSKYNFQTPEKENSSY 

f4-15.nt 

TAAATGAGCA AAAAAGTAAT TTTAATATTA CTAGAAATTT 
TCTATAAATA AAGAACAAAA AACCAAAGAA AAAACATCTG 
CAAAATATTG AAAAACAAGA GCCTGAAAAA CAGAAACAAA 
ACGGTATCAA TTCAAACGGT AGAAATAAGG GAATCAAATC 
AAGTACTACA AGCAAGCTTA TCCGATTCAA ACATTCACTC 
GAAAAGGAAT TTCTAAAACC AGAAGATAAA ATCTTGCCCA 
TTGAGCATCT TAATAAATAA AAAATTGTTA GACTTTAAAG 
TCAACTTTAA AAAATTTCAA AGAAATTAAA AATATTGAGA 
TTATTATTTG TCTTAACCCT TAAAGATAAA AATAACAACA 
AATCCCCCAA ACGACATCCA AAAACCCAAA GATTATATTT 
ATTAAAAAGG GTACTGGTGA GAAATACTTA AATCCTATCT 
AAAAAAGATT ATCATTCAAT AGATTACAAC AAAGTGACTA 
TTGGACCTAC TGCCTCACGA ACAAGTCTTT CAAATGAATA 
GACACAATAA CAGACTTAAA TAATCTAAAA TTAGTAATTC 

t4-15.nt 

TTGTGATTTATCTATAAATAAAGAACAAAAAACCAAAGAAAAAACATCTG 
ATTGAAAAACAAGAGCCTGAAAAACAGAAACAAAATGCAGCAAAAATAATC 
AAATAAGGGAATCAAATCAAATTCCAAAAAGCATTGAGAAGTACT 
TCTTGATTTTAGCATCACAAGAGAAAAGGAATTTC 



TGATCTTGTC 
AAAAGCAAGA 
ATGCAGCAAA 
AAATTCCAAA 
TTGATTTTAG 
CACAGGGGAA 
CCCCAGAAAA 
ATTTCTTCCA 
ACACTATTAA 
TAAAAGACCT 
ATAGATTTCA 
TTAGCGAAAA 
AAAATTTCAC 
AAAAAGAATT 



TTGTGATTTA 
ATCTGAAAAA 
AATAATCCCT 
AAGCATTGAG 
CATCACAAGA 
AGTGGAGTCT 
TCCAAAAAGC 
AAATCAAGAC 
CATCATGCTC 
TAAAGACACA 
AATAAAAAAC 
AACAATAGAA 
TAAAATTTTA 
AGTGTAA 
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GAGTCTTTO^GCATCTTAATAAATAAAAAATTG 

AAAATTTCAAAGAAATTAAAAATATTGAGAATTTCTTCCAAAATCAAG 

TAAAAATAACAACAACACTATTAACATCATGCTCAATCCCCCAAACGACATC 

AAAGACCTTAAAGACACAATTAAAAAGGGTACTGGTGAGAAATAC 

ACAAAAAAGATTATCATTCAATAGATTACAACAAAGTGACT^^ 

TCACGAACAAGTCTTTC^VAATGAATAAAAATTTCACTAAA 

f4-15.aa 

MSKKVILILL EILILSCDLS INKEQKTKEK TSEKQESEKQ NIEKQEPEKQ KQNAAKIIPT 
VSIQTVEIRE SNQIPKSIEK YYKQAYPIQT FTLDFSITRE KEFLKPEDKI LPTQGKVESL 
SILINKKLLD FKAPENPKSS TLKNFKEIKN IENFFQNQDL LFVLTLKDKN NNNTINIMLN 
PPNDIQKPKD YILKDLKDTI KKGTGEKYLN PIYRFQIKNK KDYHS IDYNK VTISEKTIEL 
DLLPHEQVFQ MNKNFTKILD TITDLNNLKL VIQKELV 

t4-15.aa 

CDLS INKEQKTKEKTSEKQESEKQNI EKQEPEKQKQNAAKI I PTVS IQTVEI RESNQ I PKS I EKYYKQAYPIQTFT 

LDFSITREKEFLKPEDKILPTQGKWSSLSILINKKL^ 

KNNNNTINIMLNPPNDIQKPKDYILKDLKDM 

HEQVFQMNKNFTK 

f4-50.nt 

TAGAAGGAGG AAAAAATGAA AATTGGAAAG CTAAATTCAA TAGTTATAGC CTTGTTTTTT 
AAACTATTGG TCGCATGTAG TATTGGATTA GTAGAAAGAA CAAATGCAGC TCTTGAATCG 
TCCTCTAAGG ATTTAAAAAA CAAAATTTTA AAAATAAAAA AAGAAGCCAC GGGAAAAGGT 
GTACTTTTTG AAGCTTTTAC AGGTCTTAAA ACCGGTTCCA AGGTAACAAG TGGTGGACTA 
GCCTTAAGAG AAGCAAAAGT ACAAGCCATT GTTGAAACAG GAAAGTTCCT TAAGATAATA 
GAAGAAGAAG CTTTAAAGCT TAAAGAAACT GGAAACAGTG GTCAATTCTT GGCTATGTTT 
GACTTAATGC TTGAGGTTGT AGAATCGCTA GAAGACGTTG GAATAATAGG CTTAAAAGCC 
CGTGTTTTAG AGGAATCTAA AAATAATCCT ATAAACACAG CTGAAAGATT GCTTGCGGCT 
AAAGCTCAAA TAGAAAATCA ACTTAAAGTG GTTAAGGAAA AACAAAATAT TGAAAATGGT 
GGAGAGAAAA AAAATAATAA AAGCAAAAAA AAGAAATAA 

t4-5(Knt 

ATGTAGTATTGGATTAGTAGAAAGAACAAATGCAGCTC 

AAAATAAAAAAAGAAGCCACGGGAAAAGGTGTACTTTTTGAAG^ 

CAAGTGGTGGACTAGCCTTAAGAGAAGCAAAAGTAGAAGCCATTC 

AGAAGAAGCTTTAAAGCTTAAAGAAACTGGAAAC^TGG 

GTAGAATCGCTAGAAGACXSTTGGAATAATAGGCT^ 

ACA(^GCTGAAAGATTGCTTGCGGCTAAAGCTCAAA 

TGAAAATGGTGGAGAGAAAAAAAATAATAAAAGCAAAAAAAAGAAA 

f4-50.aa 

KEEKMKIGKL NSIVIALFFK LLVACSIGLV ERTNAALESS SKDLKNKILK IKKEATGKGV 
LFEAFTGLKT GSKVTSGGLA LREAKVQAIV ETGKFLKIIE EEALKLKETG NSGQFLAMFD 
LMLEWESLE DVGIIGLKAR VLEESKNNPI NTAERLLAAK AQIENQLKW KEKQNIENGG 
EKKNNKSKKK K 



t4-50.aa 
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CSIGLVERTNAALESSSKDLKNKILKIK^ 

EEALKLKETGNSGQFLAMFDLMLEWESLEDVGI IGLKARVLEESKNNPINTAERIiLAAKAQI ENQLKWKEKQNI 
ENGGEKKNNKSKKKK 

f 4-66. nt 

TAATTTTTAA AATTTAAATA TTTACATAAT AGTAATGTGT GTGGGAGACG TATGAAAAAT 
ATTTTATTAT TTGTTATTTT ATTATTCTTT TCTTGTAAAG AATTTAATTA TTCTGATCTT 
AGGAGAAGGC CTTCAAAGGT TTTAAATGCT TCTAATGGTG CATCAAATAA AGAACTTAAA 
ATTTCTTTTG TAGATTCTTT AAATGATGAT CAAAAAGAAG CTTTGTTTTT TCTTGAACAG 
GTAGTTCTTG ATAGCAATCC CGACAAGTTT AATCAAATTT TTAATTTAAA TGAAGAGAAG 
GTAAAAGAAA TGCTTGTTAC TGTTGTTAAG TGTTTAAAGG CCAAAAGAAA GGCTAAAATG 
GCTCTTGAGA GCTCAAATGT TGCAAATGTT GCCAATGCTA AACAGCAATT GCTACAGGTT 
GAAAAAACTT ACATAGATAA TTTGCGACAA TCTTTTATGA CTACTAAAAA CATTGAAGAG 
GCTTGTAATC TTGTAAAAAA TTATGATGCA TCTGCTTCGT TTTAA 

t4-66.nt 

TTGTAAAGAATTTAATTATTCTGATCTTAGGAGAAGGCCTTCAAA^ 

TTCTTGATAGCAATCCCGACAAGTTTAATCAA^ 
TGTTGTTAAGTGTTTAAAGGCCAAAAGAAAGGCTAAAATGGCTCT^ 
GCTAAACAGCAATTGCTACAGGTTGAAAAAACTTACATAGATAATTTO 
TTGAAGAGGCTTGTAATCTTGTAAAAAATTATGATGCATCTGCT^ 

f 4-66 .aa 

FLKFKYLHNS NVCGRRMKNI LLFVTLLFFS CKEFNYSDLR RRPSKVLNAS NGASNKELKI 
SFVDSLNDDQ KEALFFLEQV VLDSNPDKFN QIFNLNEEKV KEMLVTWKC LKAKRKAKMA 
LESSNVANVA NAKQQLLQVE KTYIDNLRQS FMTTKNIEEA CNLVKNYDAS ASF 

t4-66.aa 

CKEFNYSDLRRRPSKVLNASNGASNKELKI SFVDSLNDIX^KEALFFLEQVVLDSNPDKFNQ I FNLNEEKVKEMLVT 
VVKCIJCAKRKAKMALESSNVANVANAKQQLLQVEKTYIDNI^ 

f42-l.nt 

TAATTATTAA AATCTAAGGA GAAGAGATTT ATGAACAAAA AATTTTCTAT TTCATTATTA 
TCTACAATAT TAGCCTTCTT GTTAGTATTA GGTTGTGATT TGTCAAGCAA TAATGCTGAA 
AACAAAATGG ATGATATTTT TAATTTAGAA AAGAAATACA TGGATAATTC AAATTATAAA 
TGTTTAAGTA AAAATGAGGC TATAGTTAAA AATTCTAAAA TTAAATTAGG TGTAAATAAT 
ACTAGAAGTC GTTCTTATTC TTCTAGAGAG ACTAATGTTT CGGATTCCTA TAATAAAACC 
TATTCATATT GCAAAAGCAA CTGA 

t42-l.nt 

TTGTGATTTGTCAAGCAATAATGCTGAAAAC^^ 

TCAAATTATAAATGTTTAAGTAAAAATGAGGCTATAGTTAAAAATTOT 

GAAGTCGTTCTTATTCTTCTAGAGAGACTAATGTTTC 

C 

f42-l.aa 

LLKSKEKRFM NKKFSISLLS TILAFLLVLG CDLSSNNAEN KMDDIFNLEK KYMDNSNYKC 
LSKNEAIVKN SKIKLGVNNT RSRSYSSRET NVSDSYNKTY SYCKSN 
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t42-l.aa 



CDLSSNNAENKMDDI FNLEKKYMDNSNYKCLSKNEAIVKNSKIKLGVNNTRSRSYS SRETNVSDSYNKTYS YCKSN 



f43-3.nt 

TGAATATTAA TAATAAAAAA AGGAATAANA 
TTTTTACTAA TGCTAAACAG CTGTAATTCT 
AGACAAAAAC GTGATTTAAC CCAAAAAGAA 
GACCTGCTTA GAGAAAAGCT ATCTGAAGAC 
GCTTTAACTG GTGCTGGAGA ATTTGATAAA 
GGTGCACTTA ATCATATAAA GAGTGAACTT 
CAAAAAAGCA CCTTCAAAGA GGTGGTTAAG 
GCAACTAGTG CAAGTAGTAC CTGCCAAGCT 

t43-3.nt 



ATGAAAATTA TCAACATATT ATTTTGTTTA 
AATGATACTA ATACTAGCCA AACAAAAAGT 
GCAACACAAG AAAAACCAAA ATCTAAAGAA 
CAAAAAACAC ATCTTGACTG GTTAAAAACC 
TTTTTAGGAT ATGACGAAGA CAAAATAAAA 
GATAAGTGTA CTGGGGATAA TTCTGAACAA 
GGGGCTCTTG GTGGCGGTAT AGATAGTTTT 
CAGCAATAA 



CK3TAATTCTAATGATACTAATACTAGCCAAACAAAAAGTAGACAAAAACGTC 

CAAGAAAAACCAAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTATCTGAAG 

TAAAAACCGCTTTAACTGGTGCTGGAGAATTTGATAAATTTTTAGGATATG 

TAATCATATAAAGAGTGAACTTGATAAGTGTACTGGGG 

GTTAAGGGGGCTCTTGGTGGCGGTATAGATAGTTTTGCAACTAGTGCAAGTA 



f43-3.aa 



ILIIKKGIXM KIINILFCLF LLMLNSCNSN DTOTSQTKSR QKRDLTQKEA TQEKPKSKED 
LLREKLSKDQ KTHLDWLKTA LTGAGEFDKF LGYDEDKIKG ALNHIKSELD KCTGDNSEQQ 
KSTFKEWKG ALGGGIDSFA TSASSTCQAQ Q 



t43-3.aa 



CNSNOTNTSQTKSRQK3U}LTQKEATQEK^ 

NHIKSELDKCTGDNSEQQKSTFKEWKGALGGGIDSFATSASSTCQAQQ 



f45-2.nt 



TAGGAGAGAA TAATTATGAA TAAAAAAACA 
ATTTCTTGCA AGAATTTTGC AACTGGTAAA 
AAAGGATTTG TAAATAAGAT TTTAGATCCA 
AAAGTAGATG AAGTAGCAAA AAAATTACAA 
GATGATCCTA ATGGCAGTGG AATAAATCCG 
AATGCATTAG TATTAAAAGC AATAGAACAA 
GAAGCTGAAG CTAAAGTTGA AGAAAATAAA 
AAAGAAAAAG AAATAATAGA CGAACAAAAC 
GAACAACAAA AAGAACAAAA AAGACATCAA 
AAAGAAAAAA GAGAAAGAGA AGAGGCAGAA 
AAAAGGCAAG TTGATAACCA AATTAAAACA 
AATATTGATG TTATAAAATG GCAAACGACT 
ACTGGGCCTG TGTATGATGA TTTTACCAAT 
GGGTTAGAAG AGGAATCAGA AGACGAAGGA 
GCTAGGGACG CGCTAAGAAC TAAATTAAAT 
GAGCCTAAGT TAAAAGAAAG TGTAAATGTT 
AAATCAAAAT TAGAAGAAGT TAAAAAATAT 
AAAGGATACA TCAGTGACAG TCAGTAA 



TTGATTATTT GTGCTGTTTT TGCGCTGATA 
GATATAAAAC AAAATTCAGA AGGGAAAATT 
GTAAAGGATA AAATTGCTTC AAGTGGTACA 
GAAGAAGAAA AAGAAGAATT AATGCAGGGC 
CCACCAGTAT TGCCGGAAAA TATTCACAAT 
AGTGATGGTC AACAAGAAAA AAAAGTAGAA 
GAAAAACAAG AGAATACAGA AGAAAACATT 
AAACAAGAAT TAGCTAAAGC TAAAGAAGAA 
GAAGAGCAAC AAAGAAAAGC TAAAGCAGAA 
CAACAAAAAC GACAACAAGA AGAGGAAGAA 
CTTATAGCTA AAATAGATGA GATCAATGAA 
GTAGGCCCAC AAGGCGTTAT AGATAGAATT 
GGCAATAATT CTATACGCGA AACTTGGGAG 
TTAGGAAAAT TATTGAAAGA ATTGAGTGAT 
GAAGGCAATA AACCATATAC TGGTTACGAA 
AGCGAAATTA AAGAAGATTT AGAAAAATTA 
CTTAAAGATA GTTCTAAATT TGAAGAAATT 



t45-2.nt 
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TTGCAAGAATTTTGCAA^ 

TTAGATCCAGTAAAGGATAAAATTGCTTCAAGTGGTACAAAAGTAGATGAAGTAGCAAAAA 

AAAAAGAAGAATTAATGCAGGGCGATGATCCTAATGGCAGTGGAATAAATCCGCCACC^ 

TCACAATAATGCATTAGTATTAAAAGCAATAGAACAAAGTGATGGTC^ 

GCTAAAGTTGAAGAAAATAAAGAAAAACAAGAGAATACAGAAGAAAACATTAAA 

AAAACAAACAAGAATTAGCTAAAGCTAAAGAAGAAGAACAACAAAAAGAACAAAAAAGACATCAAG 

AAGAAAAGCTAAAGGAGAAAAAGAAAAAAGAGAAAGAGAAGAGG 

GAAAAAAGGCAAGTTGATAACCAAATTAAAACACTT^^ 

TAAAATGGCAAACGACTGTAGGCCCACAAGGCGTTATAGATAGAATTACTGGGCCTC 

TGGCAATAATTCTATACGCGAAACTTGGGAGGGGTTAGAAGAGGAATCA 

AAAGAATTGAGTGATGCTAGGGACGCGCTAAGAACTAAATT^ 

AGCCTAAGTTAAAAGAAAGTGTAAATGTTAGCGAAATTAAAG 

AGTTAAAAAATATCTTAAAGATAGTTCTAAATTTGAAGAAATTAAAGGATACATC 

f45-2.aa 



ERIIMNKKTL IICAVFALII SCKNFATGKD 
VDEVAKKLQE EEKEELMQGD DPNGSGINPP 
AEAKVEENKE KQENTEENIK EKEIIDEQNK 
EKREREEAEQ QKRQQEEEEK RQVDNQIKTL 
GPVYDDFTNG NNSIRETWEG LEEESEDEGL 
PKLKESVNVS EIKEDLEKLK SKLEEVKKYL 



IKQNSEGKIK GFVNKILDPV KDKIASSGTK 
PVLPENIHNN ALVLKAIEQS DGQQEKKVEE 
QELAKAKEEE QQKEQKRHQE EQQRKAKAEK 
IAKIDEINEN IDVIKWQTTV GPQGVIDRIT 
GKLLKELSDA RDALRTKLNE GNKPYTGYEE 
KDSSKFEEIK GYISDSQ 



t45-2.aa 

CKNFATGKDIKQNSEGKI KGFVNKI LDPVKDKIASSGTKVDEVAKKLQEEEKEELMQGDDPNGSGINPP PVLPENI 
HNNALVLKAIEQSIX3QQEKKVEEAEAKVEENKEKQEOT 

RKAKAEKEKREREEAEQQKRQQEEEEKRQVDNQIKTLIAKIDEINENIDVIKWQTTV 

GNNSIRETWEGLEEESEDEGLGKLLKELSDARDALRTKLNEGNKPYTC 

VKKYLKDSSKFEEIKGYISDSQ 



f47-2.nt 



TGAATATTAA TAATAAAAAA AGGAGTAACA 
TCTTTGCTAC TACTAAATAG CTGTAATTCC 
CAACAAACAA AAAGCAGGAA AAAACGTGAT 
AAAATCACTT TAACATCCGA CGAAGAAAAA 
TACACAATTG AAAAATTAAA CAATGAAATA 
TGTAATGACT TCTTTGATTG GCTTTCTGAA 
GCTTTTACCA AGGTTTACAA CTTCTTAAAA 
TATATTAAAG GAGCTATTGA TTGTAAAAAA 
GAAATATGGG GAGGTGGACA ACTTANTAGN 

t47-2.nt 



ATGAAAATCA TCAACATATT ATTTTGTATA 
AATGATAATG ACACTTTAAA AAACAATGCC 
TTAAGCCAAG AAGAACTGCC ACAACAAGAA 
ATGTTTACTT CATTAATCAA TGTGTTTAAA 
CAAGGGTGCA TGAATGGAAA CAAAAGTAAA 
GATATTCAAA AACAAAAAGA ATTAGCTGGT 
TCAAAAGCAC AAAATGAAAC TTTTGATACT 
AACACTCCAC AAGATTGTAA TAAAAATAAT 
GCAATATTTT AG 



(TOTAATTCCAATGATAATGACACTTTAAAAAACAATGC^ 

CAAGAAGAACTGCCACAACAAGAAAAAATCACTTT^ 

TGTTTAAATACACAATTGAAAAATTAAACAATGAAATACAAGG^ 

CTTCTTTGATTGGCTTTCTGAAGATATT 

CAAAAACAAAAAGAATTAGCTGGTGCTTTTACCAA 

TTGATACTTATATTAAAGGAGCTATTGATTGTAAAAAAAACACTCCACA^ 



f47-2.aa 



ILIIKKGVTM KIINILFCIS LLLLNSCNSN DNDTLKNNAQ QTKSRKKRDL SQEELPQQEK 



WO 98/59071 



PCIYUS98/12718 



201 

TABLE 1. Nucleotide and Amino Acid Sequences 

ITI/TSDEEKM FTSLINVFKY TIEKLNNEIQ GCMNGNKSKC NDFFDWLSED IQKQKELAGA 
FTKVYNFLKS KAQNETFDTY IKGAIDCKKN TPQDCNKNNE IWGGGQLXXA IF 

t47-2.aa 

CNSNDNDTLKNNAQQTKSRKKRDLS 
FFDWLSEDIQKQKEIAGAFTKVYNFLK^ 

f49-2.nt 

TAAATGTTCA AAACAATCAT TAAACAAAAA AATATGAAAA AAATTTCAAG TGCAATTTTA 
TTAACAACTT TCTTTGTTTT TATTAATTGT AAAAGCCAAG TTGCTGATAA GGCGAGTGTG 
ACGGGGATTG CTAAGGGAAT AAAGGAGATT GTTGAAGCTG CTGGGGGGAG TGAAAAGCTG 
AAAGTTGCTG CTGCTGAAGG GGAGAATAAT GAAAAGGCAG GGAAGTTGTT TGGGAAGGCT 
GGTGCTGGTA ATGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGCGCAGGAT 
GGAGAGAAGC CTGGGGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGAG 
GATGGTGCGG AGTTTAAGGA TGAGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGGCGAGTTG TTGGATAAGC TGGTAAAAGC TGTAAAGACA 
GCTGAGGGGG CTTCAAGTGG TACTGCTGCA ATTGGAGAAG TTGTGGCTGA TGATAATGCT 
GCGAAGGTTG CTGATAAGGC GAGTGTGAAG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGGGAGTAA AAAGCTGAAA GTTGCTGCTG CTAAAGAGGG CAATGAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGCT GCTCATGCTG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGTTAGT GGGGAGCAGA TATTAAGTGC GATTGTTAAG 
GCTGCTGGTG CGGCTGCTGG TGATCAGGAG GGAAAGAAGC CTGGGGATGC TAAAAATCCG 
ATTGCTGCTG CTATTGGGAA GGGTGATGCG GAGAATGGTG CGGAGTTTAA TCATGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAGTGG TGGTGGTGAG AAAGGGAAGG CTGAGGGGGC TATTAAGGGA 
GCTGCTGAGT TGTTGGATAA GCTGGTAAAA GCTGTAAAGA CAGCTGAGGG GGCTTCAAGT 
GGTACTGATG CAATTGGAGA AGTTGTGGCT AATGCTGGTG CTGCAAAGGT TGCTGATAAG 
GCGAGTGTGA CGGGGATTGC TAAGGGGATA AAGGAGATTG TTGAAGCTGC TGGGGGGAGT 
GAAAAGCTGA AAGTTGCTGC TGCTACAGGG GAGAGTAATA AAGGGGCAGG GAAGTTGTTT 
GGGAAGGCTG GTGCTGGTGC TAATGCTGGG GACAGTGAGG CTGCTAGCAA GGCGGCTGGT 
GCTGTTAGTG CTGTTAGTGG GGAGCAGATA TTAAGTGCGA TTGTTAAGGC TGCTGATGCG 
GCTGATCAGG AGGGAAAGAA GCCTGGGGAT GCTANAAATC CGATTGCTGC TGCTATTGGG 
AAGGGTNATG NGGAGAATGG TGCGGAGTTT AANNATGANG GATGA 

t49-2.nt 

TTGTAAAAGCCAAGTTGCTGATAAGGCGAGTGTGACGGGGATT^ 

GGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTGAAGG 

CTGGTGCTGGTAATGCTGGGGACAGTGAGGCTGCTAGCAAGGCG 

GATATTAAGTGCGATTGTTAAGGCTGCTGGTGAGGCTGCGCAGC^ 

ATTGCTGCTGCTATTGGGAAGGGTAATGAGGATGGTGCGGAGTT^ 

CTGCTGCTATTGCTTTGAGGGGGATGGCTAAGGATGGAAAGTT^ 

GGGGGCTATTAAG 

f49-2.aa 

MFKTIIKQKN MKKISSAILL TTFFVFINCK SQVADKASVT GIAKGIKEIV EAAGGSEKLK 
VAAAEGENNE KAGKLFGKAG AGNAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAAQDG 
EKPGEAKNPI AAAIGKGNED GAEFKDEMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE 
GAIKGAGELL DKLVKAVKTA EGASSGTAAI GEWADDNAA KVADKASVKG IAKGIKEIVE 
AAGGSKKLKV AAAKEGNEKA GKLFGKVDAA HAGDSEAASK AAGAVSAVSG EQILSATVKA 
AGAAAGDQEG KKPGDAKNPI AAAIGKGDAE NGAEFNHDGM KKDDQIAAAI ALRGMAKDGK 
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FAVKSGGGEK GKAEGAIKGA AELLDKLVKA VKTABGASSG TDAIGEWAN AGAAKVADKA 
SVTGIAKGIK EIVEAAGGSE KLKVAAATGE SNKGAGKLFG KAGAGANAGD SEAASKAAGA 
VSAVSGEQIL SAIVKAADAA DQEGKKPGDA XNPIAAAIGK GXXENGAEFX XXG 

t49-2.aa 

CKSQVADKASVTGIAKGIKEIVEAAGGSEKLK^^ 

ILSAIVKAAGEAAQDGEKPGEAKNPIAAAIGKGNEIX5AEFKDEMKKDDQIAAAIALRGM 
GAIK 

f5-14.nt 

TAGAAATTCA AAACAAAGGA GAAAACAAAA AGTATGAATA AAAAAATATT GATTATTTTT 
GCTGTTTTTG CACTTATAAT TTCTTGTAAA AATTATGCAA CTGGTAAAGA TATAAAACAA 
AATGCAAAAG GGAAAATTAA AGGATTTTTA GATAAGGTTT TAGATCCAGC AAAAGATAAA 
ATTACTTCAA GTAGTTCAAA AGTAGATGAA TTAGCAAAAA AATTACAAGA AGAAGATGAA 
GATAATGAAT TAATGCAGGG CGATGATCCT AATAACAGAG CAATAGCACT GTTACCAGTA 
TTGCCGGAAA ATAGTCATGA CAATCCACCA GTACCAAAAG TAAAAGCAGC AGCACAAAGT 
GGTGGTCAAC AAGAAGACCA AAAAGCAAAA GAATCTAAAG ATAAAGTTGA GGAAGAAAAA 
GAAGTTGTAG AGGAGAAAAA AGAAGAACAA GATAGTAAAA AAGAAAAAGT GGAGAAGCAA 
AGTCAAAAGC AAAAAGAAGA AGAGAGAAAC TCTAAAGAAG AACAACAAAA ACAAGAAGAA 
GCAAAAGCTA GAGCAGATAG AGAAAGAGAA GAACGACTAA AACAACAAGA ACAAAAAAGA 
CAACAGGAAG AAGCTAGGGT TAAAGCAGAA AAAGAAAAAC AAGAAAGAGA GGAACAACAA 
AAACAAGAAG AAGAAAAGAA AGTTAAATAT AAAATTAAAA CACTTACAGA CAAAATAGAT 
GAAATAAATA AGGATATTGA TGGTATAAAT GGTAAAACAA TTGTAGGAGC AGAAGAAGTT 
ATAGATAAAA TTACGGGGCC TGTATATGAT GATTTTACTG ATGGGAATAA AGCTATATAC 
AAAACTTGGG GAGATTTAGA GGATGAAGAA GGCGAAGAAT TAGGAAAATT ATTGAAAGAA 
TTGAGTGATA CTAGACATAA TTTAAGAACC AAATTAAATG AGGGTAATAA AGCATATATT 
GTTCTAGAAA AGGAGCCTAA TTTAAAAGAA AATGTAAATG TTAGTGATAT TCAATCAGAT 
TTAGAAAAAT TAAAATCAGG ATTAGAAGAA GTTAAAAAAT ATTTTGAAAA TGAAGATAAT 
TTTGAAGAAA TTAAAGGATA CATTGAGGAT AGTAATTCAT ATTGA 

t5-14.nt 

TTGTAAAAATTATGCAACTGGTAAAGATATAAAACAAAATGCAAAAGGGAAAA 
TTAGATCCAGCAAAAGATAAAATTACTTCAAGTAGTTCAAT^GTAGATC 
ATGAAGATAATGAATTAATGCAGGGCGATGATCCTAATAACAGAGCAAT^ 
TAGTCATGACAATCCACCAGTACCAAAAGTAAAAGCAGCAGCA 
*AAAGAATCTAAAGATAAAGTTGAGGAAGAAAAAGAAGTTGTA 
AAAAAGTGGAGAAGCAAAGTGAAAAGCAAAAAGAAGAAGAGAGAAJ^ 
AGCAAAAGCTAGAGCAGATAGAGAAAGAGAAGAACGACTAAAACAACAAGAACAAAAAAG^ 
AGGGTTAAAGCAGAAAAAGAAAAACAAGAAAGAGAGGAACAAC^^ 
AAATTAAAACACTTACAGACAAAATAGATGAAATAAATAAGGA 
AGCAGAAGAAGTTATAGATAAAATTACGGGGCCTGTATATGATGATTTTA 
ACTTGGGGAGATTTAGAGGATGAAGAAGGCGAAGAATTAGGAAAATT^ 
ATTTAAGAACCAAATTAAATGAGGGTAATAAAGCATATATTGTTCTAG^ 
AAATGTTAGTGATATTCAATCAGATTTAGAAAAATTA 
GAAGATAATTTTGAAGAAATTAAAGGATACATTGAGGATAGTAATTCA 

f5-14.aa 

KFKTKEKTKS MNKKILIIFA VFALIISCKN YATGKDIKQN AKGKIKGFLD KVLDPAKDKI 
TSSSSKVDEL AKKLQEEDED NELMQGDDPN NRAIALLPVL PENSHDNPPV PKVKAAAQSG 
GQQEDQKAKE SKDKVEEEKE WEEKKEEQD SKKEKVEKQS QKQKEEERNS KEEQQKQEEA 
KARADREREE RLKQQEQKRQ QEEARVKAEK EKQEREEQQK QEEEKKVKYK IKTLTDKIDE 
INKDIDGING KTIVGAEEVI DKITGPVYDD FTDGNKAIYK TWGDLEDEEG EELGKLLKEL 
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SDTRHNLRTK LNEGNKAYIV LEKEPNLKEN VNVSDIQSDL EKLKSGLEEV KKYFENEDNF 
EEIKGYIEDS NSY 



t5-14.aa 



CKNYATGKDI KQNAKGKIKGFLDKVLDPAKDKITSS SSKVDELAKKLQ^ 

SHDNPPVPKVKAAAQSGGQQEDQKAKESKDKVEEEKEVVEEKKEEQDSKKEK^ 

AKARADREREERLKQQEQKRQQEEARVKMKEKQEREEQQKQEEEKKVKYKIKTLTDKI 

AEEVIDKITGPVYDDFTIX3NKAIYKTWGDLEDEEGEELGKLLKELSOT 

NVSDIQSDLEKLKSGLEEVKKYFENEDNFEEIKGYIEDSNSY 

f5-15.nt 



TAACTTATGA ATAAGAAAAT GAAAATGTTT 
TCTTGCAAGA ATTATGCAAG TGGTGAAAAT 
TCAGAACAAA ATGTAAAAAA AACAGAACAA 
GAAATTCTAG AGACAAAAGA TTTATCTAAA 
AAACAAATTC AAGAATTAAA GAATAAAATA 
GAAACATATT CTGAGTATGA AGAAAAAATA 
GGACTTGAAG ATAAATTTAA GGAGCTTGAA 
AAAAAAGCTT TACAAGAGGC CAAACAGAAA 
TCAACTGGGA AAACTCAAGG CGACAGGTCT 
TGGCAGTGTG CCAATGAATT AGGTTTGGGT 
AGCAATACTG ATGAATTAGC AAACAAAGTT 
GAACTTAAGG GAATAGAAGA AGATAAAAAA 



ATTATTTGTG CTGTTTTTGC ATTGATGATT 
CTAAAAAATT CAGAACAAAA TCTAGAAAGT 
GAGATAAAAA AACAAGTTGA AGGATTTTTA 
TTAGATGAAA AAGATACAAA AGAAATTGAA 
GAAAAATTAG ATTCTAAAAA AACTTCTATT 
AACAAAATAA AAGAAAAATT GAAAGGAAAA 
GAGAGTTTAG CAAAGAAAAA GGGGGAGAGA 
TTTGAAGAAT ATAAAAAACA AGTAGATACT 
AAAAACCGAG GTGGTGTTGG AGTGCAAGCT 
GTAAGTTATT CTAATGGCGG CAGTGACAAC 
ATAGATGATT CTCTTAAAAA GATTGAAGAA 
GAATAA 



t5-15.nt 



TTGCAAGAATTATGCAAGTGGTGAAAATCTAAAAAATTCAGAACAAAATC 

AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTCTAGAGACA 

ATGAAAAAGATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAA 

AACTTCTATTGAAACATATTCTGAGTATGAAGAAAAA 

GAAGATAAATTTAAGGAGCTTGAAGAGAGTTTAGCAAAGAAA 

AACAGAAATTTGAAGAATATAAAAAACAAGTAGATACTTCAACTGGGAAAAOT 

AGGTGGTGTTGGAGTGCAAGCTTGGCAGTGTGCCAATGAATO 

GACAACAGCAATACTGATGAATTAGCAAACAAAGTTATAGATO 

GAATAGAAGAAGATAAAAAAGAA 



f5-15.aa 



LMNKKMKMFI ICAVFALMIS CKNYASGENL 
ILETKDLSKL DEKDTKEIEK QIQELKNKIE 
LEDKFKELEE SLAKKKGERK KALQEAKQKF 
QCANELGLGV SYSNGGSDNS NTDELANKVI 



KNSEQNLESS EQNVKKTEQE IKKQVEGFLE 
KLDSKKTSIE TYSEYEEKIN KIKEKLKGKG 
EEYKKQVDTS TGKTQGDRSK NRGGVGVQAW 
DDSLKKIEEE LKGIEEDKKE 



t5-15.aa 



CKNYASGENLKNSEQNLESSEQNVKKTEQEIKKQVEGFLEI^ 

TSIETYSEYEEKINKIKEKLKGKGLEDKFTCELEESLAKKKGERKKAIjQEAKQKFEEYK^ 
CXSVGVQAWQCANELGLGVSYSNGGSDNS^ 

f51-2.nt 

TAATTGTTTG GGGTTGTGGT AAACTTAAGG CTTATGGAGT GGATTATGAA TAAAAAAATG 
AAAATATTTA TTATTTGTGC TGTATTTGTG CTGATAAGTT CTTGCAAGAT TGATGCAACT 
GGTAAAGATG CAACTGGTAA AGATGCAACT GGTAAAGATG CAACTGGTAA AGATGCAACT 
GGTAAAAATG CAGAACAAAA TATAAAAGGG AAAGTTCAAG GATTTTTAGA AAAGATTTTA 
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GATCCAGTAA AGGATAAAAT TGCTTCAAAT GGTCCAATAG CAGATGAATT GGCAAAAAAA 
TTACAAGAAG AAGAAAAGGT AAATAACGGG GAAGAAGAAA ATGATAAAGC TGTCTTTTTA 
GGAGAAGAAT CAAAAGAGGA TGAAGAAGAA AATGAGCAAG CTGTTAATTT AGAAGAAAAA 
AATGCGGAAG AGGATAAGAA AGTTGTTAAT TTAGAAGAGA AAGAATTAGA AGTTAAAAAA 
GAGACTGAAG AAGATGAAGA TAAAGAAGAA ATAGAGAAAC AAAAACAAGA AGTGGAAAAA 
GCACAAGAAA GAAAACAACG ACAAGAAGAA AAGAAACGAA AAAAACAAGA ACAGCAAGAA 
GAAAAGAAAC GAAAACGACA AGAACAAAGA AAAGAAAGGA GAGCTAAAAA CAAAATTAAA 
AAACTTGCGG ATAAAATAGA TGAGATAAGT TGGAATATTG ATGGTATAGA AAGTCAAACA 
AGTGTAAAAC CGAAAGCAGT TATAGATAAA ATTACGGGGC CTGTATATGA TTATTTTACC 
GATGACAACA AAAAAGCTAT ATATAAAACA TGGGGAGATT TAGAAGATGA AGAAGGCGAA 
GGATTGGGAA AATTATTGAA AGAATTGAGT GATACTAGAG ATGAGTTAAG AACCAAATTA 
AATAAAGATA ATAAAAAATA TTATGCCCAT GAAAATGAGC CTCCTCTAAA AGAAAATGTA 
GATGTCAGCG AAATTAAAGA AGATTTAGAA AAAGTAAAAT CAGGATTAGA AAAGGTTAAA 
GAATATCTTA AAGACAATTC TAAATTTGAA GAAATTAAAG GATACATCAG TT ACAGTCAG 
TAA 

t51-2.nt 

TTGCAAGATTGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTC 

ACTGGTAAAAATGCAGAACAAAATATAAAAGGGAAAGTTCAAGGATTTTTAG 

ATAAAATTGCTTCAAATGGTCCAATAGCAGATGAATTGGCAAAAAAATO 

GGAAGAAGAAAATGATAAAGCTGTCTTTTTAGGAGAAGAATCAAAAGAGGATC 

AATTTAGAAGAAAAAAATGCGGAAGAGGATAAGAAAGTTGTTA^ 

AGACTGAAGAAGATGAAGATAAAGAAGAAATAGAGAAACAAAAACAAGA 

ACGACAAGAAGAAAAGAAACGAAAAAAACAAGAACAGCAAGAAGAAAAGAAACGAAAACGACAAGAAC 

GAAAGGAGAGCTAAAAACAAAATTAAAAAACTTGCGGA^ 

AAAGTCAAACAAGTGTAAAACCGAAAGGAGTTATAGATAAAATTAC 

CAACAAAAAAGCTATATATAAAACATGGGGAGATTTAGAAGATGAAGAAGGCGAAG 

GAATTGAGTGATACTAGAGATGAGTTAAGAACCAAATTAAATAAAGATAATAAAAAATATTA 

AGCCTCCTCTAAAAGAAAATGTAGATGTCAGCGAAATTAAAGAAG^ 

GGTTAAAGAATATCTTAAAGACAATTCTAAATTTGAAGAAA 

f51-2.aa 

LFGVWNLRL MEWIMNKKMK IFIICAVFVL ISSCKIDATG KDATGKDATG KDATGKDATG 
KNAEQNIKGK VQGFLEKILD PVKDKIASNG PIADELAKKL QEEEKVNNGE EENDKAVFLG 
EESKEDEEEN EQAVNLEEKN AEEDKKWNL EEKELEVKKE TEEDEDKEEI EKQKQEVEKA 
QERKQRQEEK KRKKQEQQEE KKRKRQEQRK ERRAKNKIKK LADKIDEISW NIDGIESQTS 
VKPKAVIDKI TGPVYDYFTD DNKKAIYKTW GDLEDEEGEG LGKLLKELSD TRDELRTKLN 
KDNKKYYAHE NEPPLKENVD VSEIKEDLEK VKSGLEKVKE YLKDNSKFEE IKGYISYSQ 

t51-2.aa 

CKIDATGKDATGKDATGIODATGKDATGKNAEQNIKGKVQGFLEKII^ 

EEENDKAWLGEESKEDEEENEQAVNLEEKNAEEDKKVVNLEEKELEV^ 

RQEEKKRKKQEQQEEKKRKRQEQRKERRAKNKIKKLADKIDEISWNIIX3 

NKKAJYKTWGDLEDEEGEGLGKLLKELSOT 

VKEYLKDNSKFEEIKGYISYSQ 

f 6-21. nt 

TAGGCAAAAT TTAAATTTAT AAAAACTTGT AAGGATGCTT GTATGAAAAT ATTGATAAAA 
AAGTTAAAAG TTGTATTATT TCTCAATTTA ATTTTACTTA TTTCTTGTGT TAATGAAAGT 
AATAGAAACA AATTGGTTTT TAAGCTAAAT ATTGGAAGTG AGCCTGCTAC TTTAGATGCT 
CAATTAATAA ACGATACGGT TGGATCAGGG ATTGTAAGCC AAATGTTTCT TGGCATTTTA 
GATGGAGATC CCAGGACTGG AGGATACAGA CCGGGACTTG CTAAAAGTTG GGATATTTCT 
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GATGACGGAG TAGTTTATAC GTTTCATTTA AGAGATAATC TTGTTTGGAG TGATGGAGTT 
TCCATTACTG CCGAAGAATA A 

t6-21.nt 

TTGTGTTAATGAAAGTAATAGAAACAAATTGGTTTTTAAGCTAAATATO 
CAATTAATAAACGATACGGTTGGATCAGGGATTGT^^ 
CTGGAGGATACAGACCGGGACTTGCTAAAAGTTGGGATATTTCTC 
AGATAATCTTGTTTGGAGTGATGGAGTTTCCATTACTC 

f6-21.aa 

AKFKFIKTCK DACMKILIKK LKWLFLNLI LLISCVNESN RNKLVFKLNI GSEPATLDAQ 
LINDTVGSGI VSQMFLGILD GDPRTGGYRP GLAKSWDISD DGWYTFHLR DNLVWSDGVS 
ITAEE 

t6-21.aa 

CVNESNIUnCLVFKLNIGSEPATLDAQLINiyrVGSGIVSQM 
DNLVWSDGVS ITAEE 

f6-27.nt 

TAAAGAAAAG CTTGCATAAA AAGTATAACA AATTCTTTAA 
ATAATTATTG CACTAAAATT AAATTTATAC AGTTATATAG 
ATGAAATACC TTAAAAACAT TTCCTTATTT TTGTTAATTT 
AATGGTAATT TCAATCTACA CGATACAAAC CATAAATTAG 
GACTCGATAA TAAGCAGAAA TTATGATAAT AAAATATCCA 
TTAACAGAAA AAGAAAATTT TAAAGTCAAT ATTTTCATCA 
GATCCTGAAA ATATTTTGAT AAATGAAGAA AAAATTAATT 
CTCAAAGTAA AATCTAGCTT TAATAAAAGC ATTATCAGTA 
GATCTATTAA CCTACATTTA CGATAAAAGC ACAGGGAAAT 
GACAATTGGA ACGTATCGCA CAGTATAAAA TTTAATAAGG 
ACAGATTTTG ATAAAGAAAT TAAAATATCT AAAAATATTT 
AGAAAAATTG AAATTGAAAA AACAGAGCTT AAAACAGAAT 
TACATCTACA GTATGAAAAT TCCAAAATTA TTTGAAAAAT 
TACGAAACAT TTGTTATAGC AAATTATTAC CCCTGTGAAA 
AATTTAAGCT TATACTCTGA TAAATTACGC TTTCTAAACT 
AGAAAATTAA AAATGGAGCC TCCTGTGAGA GCCTTAAAGA 
ACATTAAATA TAGTATTAAG TCCTCAAAAA ATAATAGAGC 
GATATTACTC TAAAATTAAA ATCTTACGGA GAAAAGGGAG 
AAACCACTTC TTTTAAAATT CTTAAAAGAA GTAGATCATT 
AGTAGGCATA AATTTTAA 

t6-27.nt 

TTGGAAATCCATCCCAAATGGTAATTTCAA 

GACTCGATAATAAGCAGAAATTATGATAATAAAATATCCATTGTGGGAG 
ATTTTAAAGTCAATATTTTCATCAAAAAAAAAGGATO 
AATTAATTATTCAAAATATAAAGCAGAACTCAAAGTAAAATC 
ACTAATTCAAGAGATCTATTAACCTACATT^ 

ATTGGAACGTATO^ACAGTATAAAATTTAATAAGGAGTATATTTTAGC^ 

TAAAATATCTAAAAATATTTTGCAAAAACGTATTGATAATAGAAAAAra 

GAATATAATGAAATAG&GGATTATTACATC 

CTGAAACTTACGAAACATTTGTTATAGCAAATTATTACCCCTGT^ 
CITATACTCTGATAAATTACGCTTTCTAAACTCTATTTATC 



TAATTAAAAT 
AATCACTTAA 
TAGGTTGCAA 
GAAAACTAAA 
TTGTGGGAGT 
AAAAAAAAGG 
ATTCAAAATA 
TTTCACTAAC 
ACATTAACAT 
AGTATATTTT 
TGCAAAAACG 
ATAATGAAAT 
CAGACGCTCC 
ATTTAAATAT 
CTATTTATGA 
ATTCAAAAAC 
TAGCAAAAAA 
AATTCACATT 
GCATAAAAAA 



CAAAAAGAAT 
GGAACAAAAA 
ATCCATCCCA 
ATTTCAAGAA 
ATACAACCCT 
ATTACAAATA 
TAAAGCAGAA 
TAATTCAAGA 
TGACTTTAAG 
AGCATATATA 
TATTGATAAT 
AGAGGATTAT 
CTCTGAAACT 
ACTGTTTTTG 
TGAGAATGAT 
AATAAAAGAA 
CATTGAAAAA 
TGAAATATAT 
TTTGCAATCA 
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GTGAGACXTCTTAAAGAATTC 

TAGCAAAAAACATTGAAAAAGATATTACTCTAAAATTAAAATCTTACGGAGAAAAGGGAGA 

ATATAAACCACTTCTTTTAAAATTCTTAAAAGAAGTAGATCATTGCA 

TTT 

f6-27.aa 

RKACIKSITN SLIIKIKKNI IIALKLNLYS YIESLKEQKM KYLKNISLFL LILGCKSIPN 
GNFNLHDTNH KLGKLKFQED SIISRNYDNK ISIVGVYNPL TEKENFKVNI FIKKKGLQID 
PENILINEEK INYSKYKAEL KVKSSFNKSI ISISLTNSRD LLTYIYDKST GKYINIDFKD 
NWNVSHSIKF NKEYILAYIT DFDKEIKISK NILQKRIDNR KIEIEKTELK TEYNEIEDYY 
IYSMKIPKLF EKSDAPSETY ETFVIANYYP CENLNILFLN LSLYSDKLRF LNS I YDENDR 
KLKMEPPVRA LKNSKTIKET LNIVLSPQKI IELAKNIEKD ITLKLKSYGE KGEFTFEIYK 
PLLLKFLKEV DHCIKNLQSS RHKF 

t6-27.aa 

CKS IPNGNFNLHOTNHKLGKLKFQEDS 1 1 SRNYDNKI S I VGVYNPLTEKENFKVNIF IKKKGLQIDPENILINEEK 

INYSKYKAELKVKSSFNKSIISISLTNSRDLLTYIYDKSTGKYINIDFKDNWW 

KISKNILQKRIDNRKIEIEKTELKTEYNEIEDYYIYSMKIP 

LYSDKLRFLNS I YDENDRKLKMEPPVRAL EKDITLKLKSYGEKGEFTFEI 
YKPLLLKFLKEVDHCI KNLQS SRHKF 

f6-5.nt 

TAAATGAAGA AGTTTTTAAT ATCCGTTTAT TTTTTATTGT TTTATGGTTG TTCAACTATA 
TCTTTGGTAA AAATACCAGA AAAAGATAAA ATAAATTTAA CTGTTTTATC ATCTTTAATG 
AATTATCCTG ATTTGAAGAT TTCAAATTTT AAAATAAAAG ACTACGAACA TTTGCATTAT 
TCATCTGATT TTGAAAGCTT GAGTGATACT AAAAATAGTG CTTATATTTA CGTTGATGAA 
TCTAGTTTCA ATAATAATAT TAATTTTATT AAAGATCTTT TTATTTATAA TAAGAAATTA 
TATAGAATAC TTATTGCTTA TAGCTTGACC CAAGGTGCAT CTTTTAAGGC AGAAGTTTTA 
TCTTATCTTG AAAAACAAAA AATTATGAAA AATTTTTCAT TGAAAATAAA TTTTCCAACT 
GCTAAAAAAT TTATGGATAA TAAGTATTGG ATTGTAATTG CAAAAAACCA TTTAGATTCT 
CTTGTTAAGA GTAAAAATTA TTTAGTCTTG GCGAATGTAA AGATGGAATA TATACTCAAA 
AAGTTTTTAA CTTGA 

t6-5.nt 

TTGTTCAACTATATCTTTGGTAAAAATACCAGAAAAAGATAAAA 
TATCCTGATTTGAAGATTTCAAATTTTAAAATAAAAGACTACGA^ 
TGAGTGATACTAAAAATAGTGCTTATATTTACGTTGATGAATCTA^ 
TCTTTTTATTTATAATAAGAAATTA 

GAAGTTTTATCTTATCTTGAAAAACAAAAAATTATGAAA 
AATTTATGGATAATAAGTATTGGATTGTAATTGCAAAAAACCATTT 

f6-5.aa 

MKKFLISVYF LLFYGCSTIS LVKIPEKDKI NLTVLSSLMN YPDLKISNFK IKDYEHLHYS 
SDFESLSDTK NSAYIYVDES SFNNNINFIK DLFIYNKKLY RILIAYSLTQ GASFKAEVLS 
YLEKQKIMKN FSLKINFPTA KKFMDNKYWI VIAKNHLDSL VKSKNYLVLA NVKMEYILKK 
FLT 

t6-5.aa 

CSTISLVKIPEKDKINLTVLSSLMNYPDLKISNFKIKDYEH^ 
LFIYNKiCLYRILIAYSLTQGASFKAEVLSYLEKQKI 
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f7-30.nt 

TAGAGACGAA GTCACAAGCA AAATGTTAAA 
ATAATGAAAA ATTTAAAGAC AAAAATTAAT 
TTTCTTTCTT GCGAATCAAT ACCATCACTT 
GATATTGAAA ATTTAATGCT CGATGAAGCA 
GTTTGGCTTT TGACTGTAAA ATCTTATGTG 
GTGTTTGAAA ATTTTGATCC CGTGTTTGGC 
CTAAAAAATC GAATTACCTA CTACAATCGA 
GGGTGTTACA AAAAATACAG CAGAAGATAA 



AGATTTACAA AATCAAGTTC AAGGGGGCAA 
TTTTTAGGGA TATTTTGGCT ACTGTTACTA 
CCCCAAAAAC CAACCCTAAC AAACAAAGAA 
GAACTTTTTA GATACTCAAC CGCACTAAAT 
ATCAAATACT ATCCTAATGA CAAATTTCCT 
GATGAAAATG GAACTAAAGA AACAAATATA 
TACATAGAAA AAACCGAACC GATTGTATTT 



t7-30.nt 

TTGCGAATCAATACCATCACTTCCCCAAAAACCAACCCT^ 
GAAGCAGAACTTTTTAGATACTCAACCGCACTAAATGTT^ 
ATCCTAATGACAAATTTCCTGTGTTTGAAAATTTTGATC 

TATACTAAAAAATCGAATTACCTACTACAATCGATACATAGAAAAAACCGAACCGATTC 
AAATACAGCAGAAGA 



f7-30.aa 



RRSHKQNVKR FTKSSSRGQI MKNLKTKINF LGIFWLLLLF LSCESIPSLP QKPTLTNKED 
IENLMLDEAE LFRYSTALNV WLLTVKSYVT KYYPNDKFPV FENFDPVFGD ENGTKETNIL 
KNRITYYNRY IEKTEPIVFG CYKKYSRR 

t7-30.aa 



CES I PS L PQK PTLTNKEDI ENLMLDEAELFRYSTALNVV7LLTVKS YVT KYYPNDKF PVFENFDP VFGDENGTKETN 
ILKNRITYYNRYIEKTEPIVFGCYKKYSRR 



f76-l.nt 

TGAATATTAA TAATAAAAAA AGGAGTAACA 
TTTTTACTAA TGCTAAACGG CTGTAATTCT 
AGACAAAAGC GTGATTTAAC CCAAAAAGAA 
AAAGAAGACC TGCTTAGAGA AAAGCTATCT 
AAAACCGCTT TAACTGGTGT TGGAAAATTT 
ATTAAATCAG CACTTGAACA TATAAAGACT 
GGAAAAAACA CCTTCAAAAC TACCGTTCAA 
TTTGCAGATC AAGCAACTGC TACCTGCAAT 



ATGAAAATTA TCAACATATT ATTTTGTTTG 
AATGATACAA ATACCAAGCA GACAAAAAGC 
GCAACACAAG AAAAACCTAA ATCTAAATCT 
GATGATCAAA AAACACAACT TGACTGGTTA 
GATAAATTCT TAGAAAATGA TGAAGGCAAA 
GAACTTGATA AATGTAATGG AAATGATGAA 
GGGTTTTTTA GCGGCGGCAA TATAGATAAT 
TAA 



t76-l.nt 



ctgtaattctaatgatacaaataccaagcagacaaaaagcagacaaaagcgtc 

c^gaaaaacotaaatctaaatctaaagaagacctgcttagagaaaagcta 

actggttaaaaaccgctttaactggtgttgg 

agcacttgaacatataaagactgaacttgataaatgtaatggaaatg^ 

gttcaagggttttttagcggcggcaatatagat^ 



f76~l.aa 



ILIIKKGVTM KIINILFCLF LLMLNGCNSN DTNTKQTKSR QKRDLTQKEA TQEKPKSKSK 
EDLLREKLSD DQKTQLDWLK TALTGVGKFD KFLENDEGKI KSALEHIKTE LDKCNGNDEG 
KNTFKTTVQG FFSGGNIDNF ADQATATCN 



t76-l.aa 
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CNSNDTNTKQTKSRQKRDLTQKEATQEKPKSKSKEDLLREK^ 
ALEHIKTEIJ3KOTGNDEGKNTFKTWQGFFSGGNIDNFAIX5ATATCN 

f8-10.nt 



TAAGTAAGGA GAATATTTAT GAAATATAAT 
TTAACTGCTT GCAATCCAGA TTTTAACACA 
ATTTCAAATC AAGATGCAGA TTCTGATAAA 
ATAAATTTAA TAGAAAAAGC GAATGCAGAT 
GAACCTTCGG ATCAATATGG AATGTTGGCT 
CCACGGGAAT TAATATCTGA TACAGGTAGT 
TATAGTATTT TATTAAATGC TATTGAAACT 
ATACTGTCAA TAAAAGTACT AGAAATATTT 
GATGATGTGG TTGTTCACTT AT ATTC C AAA 
AATTTAAAAA GACTTAAAAA TTTGTTTGAA 
AAGATGTCAA AACGTCTTTT ATTGGATTAT 
AACGCCAAGC TTGGATCTTA TGTGGTTGCA 
GAAGCAGAAA GGCTGAAAAG CGAGATAATT 



ACGATTATAA GCATATTTGT TTGTTTGTTT 
AATAAGAAAA GAACTCTAAG TAAGGGGATA 
ATAATAAAAA ATAAATTACT TGATGATTTA 
AGAGAAAAAT ATGTAAAAAA AATGGAAGAA 
GTTTTTGGAG GTATGTATTG GGCAGAATCA 
GAGAGATCTA TTAGGTATAG AAGGCGTGTT 
AATGAATTAA AGAAATTTTC AGAAATTAGA 
AGCCTATTTA ATCTATTTGG AAGTACTCTT 
AAAGATACTC TAGGTAAACT AGATATTTCA 
AAATTATTAT CTATAAAAAC AATCGTTTCA 
CAAAATAATG AAAATTTTAT AAAAACAGAT 
CTTTCCAATC AAATTCAAGA AAAATATAAT 
TTAATATATA CCCTTTAA 



t8-10.nt 



TTGCAATCCAGATTTTAACACAAATAAGAAAA 

GATAAAATAATAAAAAATAAATTACTTGATGATTTAATAAATTTAATAGAAAAAC^ 
ATGTAAAAAAAATGGAAGAAGAACCTTCGGATCAATATC 

ATCACCACGGGAATTAATATCTGATACAGGTAGTGAGAGATCTATTAGGTATAGAAGGCGTC 

TTAAATGCTATTGAAACTAATGAATTAAAGAAATTTTCAGAAATTAGA 

TTAGCCTATTTAATCTATTTGGAAGTACTCTTGATGATGTGGTTC 

TAAACTAGATATTTCAAATTTAAAAAGACTTAAAAA 

AAGATGTCAAAACGTCTTTTATTGGATTATCAAAATAATGAA 

CTTATGTGGTTGCACTTTCCAATCAAAT^ 

f8-10.aa 



VRRIFMKYNT IISIFVCLFL TACNPDFNTN 
NLIEKANADR EKYVKKMEEE PSDQYGMLAV 
SILLNAIETN ELKKFSEIRI LSIKVLEIFS 
LKRLKNLFEK LLSIKTTVSK MSKRLLLDYQ 
AERLKSEIIL IYTL 



KKRTLSKGII SNQDADSDKI IKNKLLDDLI 
FGGMYWAESP RELISDTGSE RSIRYRRRVY 
LFNLFGSTLD DVWHLYSKK DTLGKLDISN 
NNENFIKTDN AKLGSYWAL SNQIQEKYNE 



t8-10.aa 



COT>DFNTNKKRTLSKGIISNQDADSDKIIKN^ 
SPRELISDTGSERSIRYRRRVYSILLNAIETNELKKFSEIRILS^ 

KLDISNLKRLKNLFEKLLS IKTIVSKMSKRLLLDYQNNENF I KTDNAKIiGSYWALSNQIQEKYNEAERLK 



f8-14.nt 



TAATATATAT TCTTGATTAA GGGAAAGGAG 
TATACATTGT TAACGATAGG ATTGATGTCT 
AAAGAGGAAC AAAAAAATAA CAATGATATA 
GCTATTAATA ATTTATATGG AAATAAAAAA 
AAATTGAAAG ACAAGGGTTT AGACGTGACC 
TCCGTAGAAT CTGCGGTGTC TTTAGGAGAA 
TCAATTGAGC ATAATCAAAA AAAAGAGATA 
GAAGAAAAGC AAGCGGATAA AGCAATTAAA 



AGTATTTTTA TGAAAAAAAA AATGTTTTTA 
TGTAATCTAA ATTCTAAATT ATCTGGTAAT 
AAAGAAGCTT TAAATGGCGT TCAAGAAAAT 
GAAAAAAAAG ATTTTATTAA AAATTCGGAA 
ACCCTCCCCT TAGAACCTGT AGTGGCGCCC 
TCTAATAATA GGATTGGTAT ACCAACCATT 
AAAGAAGAGG ATTTTTTCCC TTCTACTGAG 
GATATAGAGA ATCTTATTGG AGAATCTGGA 
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TTTCCCGAGT TAATTGAGAA TGTGTGCTCA CTTAAACATG AATATACTTT AATAAGAAGT 
GATTTTTATG ATGTGATAAC TAAGATTCAG AATAAAAAAA TATCACTAAT GAAAAATTCT 
CATAATAATA GAAATAAAAT AAGGGAACTA GTACAATTGC AAAATAATTT AAAGATAGGA 
GACGAACTTG ATAAAATTAT GGGTTGCATT GATACTGCAG AACAAGAGAT AAGATCTGCC 
GCTTTCTTTT TTGATGAAGC TAAGGAAAGC TTAAAAGAAG GTATTATTAA AAGATTGGAA 
AAAAGTAAAA ATAGGGCAGC ATCACAATTA TCTAAAAAGG CTTTAAATAG AGCAGAGGAT 
GCTTTAAGGT GCTTAGAAAA TTATTCTTCT AAAAAAGGTG AGGCAATAGG AAGAAGAAGC 
TTTATAAAAG AAGTTGTTGA ACAGGCAAAA AATGCTTTAA GTAAGTCTTA A 

t8-14*nt 

TTGTAATCTAAATTCTAAATTATCTGGTAATAAA 
GGCGTTCAAGAAAATGCTATTAATAATTTAT^ 

AATTGAAAGACAAGGGTTTAGACGTGACCACCCTCCCCTTAGAACCTGTAGT^ 

GTCTTTAGGAGAATCTAATAATAGGATTGGTATACCAACCATTTCAATTC 

GAAGAGGATTTTTTCCCTTCTACTGAGGAAGAAAAGCAAGCGG^ 

GAGAATCTGGATTTCCCGAGTTAATTGAGAATGTGTGCTC 

TTATGATGTGATAACTAAGATTCAGAATAAAAAAAT^ 

AGGGAACTAGTAGAATTGCAAAATAATTTAAAG^ 

CAGAACAAGAGATAAGATCTGCCGCTTTCTTTTTTGAT^ 

ATTGGAAAAAAGTAAAAATAGGGCAGCATCACAATTATCTAAA 

TGCTTAGAAAATTATTCTTCTAAAAAAGGTGAGGCAATAGGAA^ 

CAAAAAATGCTTTAAGTAAGTCT 

f8-14.aa 

YIFLIKGKES IFMKKKMFLY TLLTIGLMSC NLNSKLSGNK EEQKNNNDIK EALNGVQENA 
INNLYGNKKE KKDFIKNSEK LKDKGLDVTT LPLEPWAPS VESAVSLGES NNRIGIPTIS 
IEHNQKKEIK EEDFFPSTEE EKQADKAIKD IENLIGESGF PELIENVCSL KHEYTLIRSD 
FYDVTTKIQN KKISLMKNSH NNRNKIRELV QLQNNLKIGD ELDKIMGCID TAEQEIRSAA 
FFFDEAKESL KEGIIKRLEK SKNRAASQLS KKALNRAEDA LRCLENYSSK KGEAIGRRSF 
IKEWEQAKN ALSKS 

t8~14.aa 

CNLNSKLSGNKEEQKNNNDIKEALNGVQENM 

SLGESNNRIGIPTISIEHNQKKEIKEEDFFPSTEEBKQADKAJKDIENLIGESGFPELIENVCSLra 

YD\TETKIQNKKISLMKNSHNNRNKIRELVQLQNNIjKIGDELDKIMGCIOT 

LEKSKNRAASQLSKKALNRAEDALRCLENYS 

fOlA.nt BB001 

TGATTAATTTTTTTTAAGGATTACGTTT^^ 
TACTGTTCTCATGCTCTTTTTATTCTAAATC 

ACTTGGAAAAATTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACC 

CAGTTCTTTAAAAATGAAAAAGAAAAAATAATTAAAAAAAT^ 

ATAAAATAGGTCCAAATATCGAAATGTTTGCTCAAA 

ATTTGGAATAAATAAAACTTTATTCACAGAAAAAAAAGACA^ 

AGAAGATTATTTTACTCATCTTTAAATTATGATGAA^ 

CAAGCTCAAACGACTACCATTACACACTTATTGGTTTAATTT^ 

AAGCGCTGTTAATATTTTAACTAAAGACGAGCAA 

ATTCAGGAAAATTTTGAAAAACTAATGCAAGAGAGAAA 

ATGACAAAAATACGGGAGGATGCAAAGCTGATGGAAAAATTCT^ 

ACTCGACTCAAATAAAAGTATGCAAATTTT^ 

TACTAA 
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tOlA.nt BB001 
TGCTCTTTTTATTCTAAATCAAACAACAG 

TTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTO 

AAATGAAAAAGAAAAAATAATTAAAAAAATTGCAGAAGAATTTGA 

CCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAA 

ATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATC 

TTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATAC 

GACTACCATTACAGACTTATTGGTTTAATTTTTTGGAC^ 

ATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTA 

TTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATA 

ACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGG^ 
ATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTG 

fOlA.aa BB001 

LIFFKDYVLKRinCIWKTLKLFQITLLFSCSFYSKSNOT 
FFKNEKEKIIKKIAQEFDENEKLINKIGPNIEMFAQTINTDIQ 

RLFYSSLNYDENKIKKLATI LAQTS S SNDYHYTL IGLIFWTGFKIQEAFESAVNILTKDEQKRLI FNFRTKTVKEI 
QENFEKLMQERNSWIKIVDNI IGEYDKNTGGCKADGKIIjGEVIRVGYEHEtiDSNKSMQILNNI ETPLKTCCDHIHY 

tOlA.aa BB0O1 

CSFYSKSNNTEAISELQSSPIKIjGKIKVLQ^^ 

PNIEMFAQTINTDIQKI EPNDQFGINKTLFTEKKDNNIDFMLKDNRLRRLFY^ LAQTSSSN 

DYHYTLIGLIFWTGFKIQEAFESAVNILTKDEQKRLI 

TGGCKAIX5KILGEVIRVGYEHELDSNKSMQILNNIETPLKTCCDHIHY 

f02A.nt BB002 

TAATTAATACTGGTTTTAATTTATAAGGAGA 

TTACAATGATATTAACTTTAATTTGCATCTCATGTGCACCTTO 

CAACKTTTAAAAAAAACACGAGACTGAAAAAAC^ 

GGAGACCTTGGCGCTTCTGATGAAAAATTTATGGGAAC 

AAGATCGAAAAAATG^TACGATATACAAATAGCCAAAATTACTAATC 

TCGGGCTTATGAACTAGCTAACGAAAATGAAAAAATGCn^ 

AAAGAAAACATAGAGACATTAAAAGAAATTCTTGAAAAACT 

GAAATTTCCTTTATCGCATAGCGCTGGATAT^ 

GGACACTCTAAGCAAAGAAAATTCAAAAGAAGATTT 

CAAGAAAAGTTTAAAAAAACCCTAAACAAAACTCTTGA^ 

AAGTACTAGCAGAACACTTTAATAAATATTACAAAGACTCTGA r ^^ 

t02A.nt BB002 

TGTGCACCTTTTAACAAAATCAATCCCAAGGCAAA 

ccc£caatccaggggaaaacatccaaaattttaaa 

gggaactaccxx:ttcagagctaaaagcaattgg 

gccaaaattactaatgaagaatctaacctattagatacttatattcgggctt 

aaatgcttttaaaaagatttcttotttc 

tgaaaaactcataaataattacgaaaacx^ccccaaaattgct^ 

caattaaaactggaaaagcacttaaaatcaataaa 

ATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGOT 

TCTTGAAGATTACCGTAAAAATACTAACAACATTC 

AAAGACTCTGATTCTTTACAATCTGCCTTTTAT 



f02A.aa BB002 
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LILVLIYKESILKKAKLJtfllKI^ 

DLGASDEKFMGTTASELKAIGKELEDRKNQYDIQIAK^ 

ENIETLKEILEKLINNYENDPKIAANFLYRIA^ 

EKFKKTLNKTLEDYRKNTNNIQENK^LAEHFNKYYKDS 

t02A.aa BBG02 

CAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKD^ 
AKITNEESNLLDTYIRAYELANENEKhlLLKRFLL 
QLKLEKHLKSINEKIOTLSKENSKEDLEALLEQVKSALQI^E^ 
KDSDSLQSAFY 

f03A.nt BB006 

TGATTTAATGTAAATTTTAATTACCGCCTAAAAAAGGCTTTAAATC^ 

GAACATATAAACATTTGGAACTAATAATGCTGCCCATC 

TGTACATCT^GACAGCAATACTGGCAAACCAATAAGC^ 

AAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATA 

ATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAA 

ATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAAT^ 

GTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTC 

GATATGTTACreCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 

TGGTCATTGGAATTATATGCTTGCAGATTT^ 

TCTCTTAATTCAAAATTAATTATTGAATTTTTA 

GAGATTTATTTGAAGATATATAA 

t03A.nt BB006 

TGCGCUTTTT T TAAGAAACCAC7ATCTGTACATCAA 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTG^ 

AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTA 

ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCX3TCATO 

GAGATGATGAATATGAAATTGAAGATGTTAAATTTC 

AGCTGTTGAAAATTCAGAAGAAGAAGGATATGTT 

AATGCTTTTAAATTAAGATATAAAAATGGTCAT^ 

CTCAAGAAACTAAAATTTATAAAATTTCTCTTAA 

AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATA 

£03A.aa BB006 

FNVNFNYRLKKALNGIKEEDLMVFRTYKHLELIMLP 

KLPI INSNHDVTWIKTKAMTI LGEDGKEI PEFKNKFGYS YI I SPVKMDGKYSYYASLL I LFETTKNGDDEYEI EDV 

KFVTAGSTLELKNSLLAVENSQEEGYWAYPFGILMS^ 

LNSKLIIEFLKEVLKENSILKDIAGDLFEDI 

t03A.aa BB006 

CAFFKKPQSVHQDSNTGKPISDEIGL*HLISGKIS^ 
ISFVKMIX3KYSYYASLLILFETTKNGDDEYEI 
NAFKLTYKNGHWNYMLADLTVKNKLTQETKIYKISLNSKLI I 

f04A.nt BB011 

TAATTACCAAAGATAAGTAAACTTGCA^TAAAACTACACGTATTGAAAG 

TATATAATGGCACTAAATATCTGAAAATGAAGGAGAAGCGGGTGGGCAATAA^ 

AATTTTAATAGTTGGTTGCGACTGGGGAACTATTAAAGATAAAAG 
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AAAGATAAGACTAAAAATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATC 

CTGATACGGGCATTACTAGTTTAGGAAGTCTAAACAACTTGGATTT 

ACCTATAATCTCAAATGAGAAAGCCATAGCTACTCAAGC^ 

ATAAACCCAAAACCAGCTCAAAATTTGGGAAATTCTTTAAAC^ 

CAATTGAAAACCAAGAGTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTO 

AACACAACACGAAAAAGAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGT 

GAAATTATTAAGTTTAAGGAAGAATATTACAAACTTTATAATTTC 

AAAGGAATTCATTTATAAAAGATACTAAATTTGGGGAAAATAGACAAAA 

ATCTATAGAGAAAGAAATTAGAGATTTGAATTATAAGTTGNGTGAAA 

AGCTGGAATAATGCAAACTCTCTTTTAAAAGAATCTATAGAAA^ 

ATGAGAGTAGAAAGCAAGGTCAAATTGGTGGACCTGCTAATAGATGGGA 

GGATGCAAAGTATAAGGCAGAACATTCAGCAAATGATTTGGAAAATGCAGCC^ 

AATGAAAAAGAAGCTAAAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATO 

t04A.nt BB011 

TGCGACTGGGGAACTATTAAAGATAAAAGTAC^GAAATTTC 
ATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAA 
TAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCX3TTCA 
GAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACAT^ 

ctcaaaatttgggaaattctttaaac^tactactactgaa 
gtggcttattagtaaaaagattttgcccagtaagt^ 
gaagcttttaagacggctaaaactatacaaagtctcato 
aggaagaatattacaaactttataatttgtt^ 

AAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAA 

ATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTT^ 

ACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCA^ 

AGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAA^ 

GCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATT^ 

AAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTAGCCTA 

f04A.aa BB011 

LPKISKXANKTTRIESRFEISIIFIYNGTKYLKMKEKRVGNKIFYISVVLILIVGCD^ 
DKTKNQDRIELGEDNFVSKNNMSTTD^ 

NPKPAQNLGNSLNNTTTEDSVKFLSIENQEWLISKKILPSKLENL 

I IKFKEEYYKLYNLFEGIQQKFHSQRNSFIKOTKFGENRQKNAVI FKSFSS IEKEIRDLNYKLXEIQSNFQIADVS 
WNNANSLLKES I EKLIQAI EKRYDNESRKQGQ IGGPANRWDKNQADNFAIODAKYKAEHS ANDLENAANYFRYSC SN 
EKEAKKLLEEIKKRFVRIGISL 

t04A.aa BB011 

CDWGTIKDKSTEISKLI^TDKDKT^ 

EKAIATQAKVDLMNNINVTIINPKPAQNLGN^ 

EAFKTAKTIQSLISNSNMGKEIIKFKEEYYKLYNL^^ 

IRDLNYKLXEIQSNFQIADVSWNNANSLLKES I EKLIQAIEKRYDNESRKQGQIGGPANRWDKNQADNFAKDAKYK 
AEHSANDLENAANYFRYSCSNEKEAKKLLEEIKKRFVRIGISL 

fOSA.nt BB009 

TAAATAAATTGTAGGATAAAAATGAAACAAAAATACGAAAACTATTTTAA 

TATTTTTACTACTAGCATGCTCAAGCGAATC 

CAATATTTTGGGCAGTTCAAGTCCAAGAGGAATTTCTC 

TTTAAAAAAGAAAACGGCAAGATTGAAAAAATTGATTTG^ 

TATCTGGAAAAACCTATCTTTTAGCGCAAAACAAAGAAGAAGAATTAG 

GACATTAAAATTTAAAAAACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGA 
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t05A.nt BB009 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCA^ 

CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTC^ 

CAAGATTGAAAAAATTGATTTGAGCAATTCT^ 

CITTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTT 

AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCG 

fOSA.aa BB009 

INCRIKMKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNLQKIKHEYNIIX3 
KKENGKIEKIDLSNSYEFINDIWISGKTYLIAQ^ 

tOSA.aa BB009 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGKIEKIDLSNSTO 
LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFIJ^SrVEEMA 

f06A.nt BB014 

TAAGGAGCATATATGAGGATTTTGGTTGGCGTTTGTA^ 
ATCAGGAACAAGCTGTTCAAACTTTTTTTG^ 

AGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTT^ 

TTAAAAGATCCTAATTATCNNGNTGTAGTACNC 

TAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTT^ 

ATTTATGCGTATAGTTCGTTGGCTGTAT^ 

GGGAGCCATGAGTATAATCGTAATATGCCTAGAC^ 

ATAATAGCCCAGTTTCTATTTTACCTACATAA 

t06A.nt BB014 

TGTTATTTGCCTGATAATCAGGAACAAGCTGT^ 

ATGAGATTGTTACTGAAGGCATATTTTCTAGTTTAA 

AAAGACTTTAATTAGTTTAAAAGATCCTAATC^ 

TTTAATAAATTCTTTCTAGATTTAGGGTCTGAGCAA 

AGCAGAACAATAATAAATTTATGCGTATAGTTCGTTGGCTC 

TAAGTATTCTGGCGAGGGGAGCCATGAGTATAATCGTAATATGCCTAGATC 

GTGAAGAGGTATGATTATAAT 

f06A.aa BB014 

GAYMRILVGVCIIALALIXXrniPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLK^ 

KDPNYXXVVXFVSDYNEEYFNKFFLDLGSEQSKDLIKLFIMVICNEQNNN^ KYSGEG 
SHEYNRNMPRPTAYEQYLKVKRYDYNS PVS I LPT 

t06A.aa BB014 

CYLPDNQEQAVQTFFENSES SDMGSDEIVTEGI FSSLKLYASEHRLLVEI KKTLI SLKDPNYXXVVXPVSDYNEEY 

FNKFFI^LGSEQSKDLIKLFIMVKNEQNNtrcFMRIW 

VKRYDYN 

f07A.nt BB023 
TAAAGTATTTTATTTTTTTTATTATCCAC 

GGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGA 



WO 98/59071 



PCT/US98/12718 



214 

TABLE 1 . Nucleotide and Amino Acid Sequences 

AAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTT 

AATAATTGGAGCGTTTTACTTACTCCT^ 

TTGTTAAGAGTGAGTGAAAAAGGTACGCAGGTGATACT 

ATCATCTGCTATGATTATGCCACCATTTAAAATTCCTT^ 

GGTCTTATTGATAACATTAAAACCATGAAAGAAA 

AGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTl^ 

TTTAATTTGGTCAAATCCTAACTATAT^ 

CTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTA 

TTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGT^ 

ATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAG 

AAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTC 

AAGAATCATCTCCGAAAAATTAG 

t07A.nt BB023 

GAGGGTTCTAAAAGGGCAGAGCCTGGAGAATTAGTT^ 

GACTTGATCITACAAATTATGTTGArrATGTATATO 

AGATCTTGGGATAAATAATTGGAGCGTTTTACTTACTCCOT 

GTTGCGCCCGCTGTTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTG^ 

GAAGCTATTCTCAATCATCTGCTATGATTATC 

ATTTTTAGGCAAAGGTOTTATTGATAACATTAAAACCA 

GAGATAGATCTTGAGGTTTTATTTGAAGATATGAATGNCATGGA^ 

AAGGGTGGGCTGATTTAATTTGGTCAAATCCTAACTATA 

TCCAAATTATCCTCTTGCTTCAAGTAAAAT^ 

CAAAATTTCATCTTTTATGTTAAAGATTTAAGAGTTC 

ACAGTGAGTCrGTATTTAAAGTTTATGAGACTAGCGGAACro 

TAAAAGNGTTTTAAAGCTTAGAGAAAAAATTTOTATGCCTGAAGO 

AGTGAAAAACCTGAAGAATCATCTCCGAAAAAT 

f07A.aa BB023 

SILFFLLSTVLFAQETIX5LAEGSKRAEPGELV 

NWSVIJ^TPSARLQAYVKNSWAPAVVKSESKRYAGDTILGVRVLFPSYSQS 
LII^KTMKEIKVSVYSLGYEIDLEVLFEDMNXMEYAXSMGTLKPKGWADLIW 
ASSKMRFKAFRVSKSHSSKEQNFIFYVKDLRVLYDK^^ 
LREKISMPEGSFQNFVEKIESEKPEESSPKN 

t07A.aa BB023 

EGSKRAEPGELVLDFAELARDPSSTRLDLTNYVDYVYSGASGIVKPEDMW 

VAPAWKSESKRYAGDTI LGTOVLFPS YSQSSAMIMPPFKIPFYSGESGNQFLGKGLIDNI KTMKEIKVSVYSI^ 
EIDLEVLFEDMNXMEYAXSMGTLKFKGWADLIWSNPNYI PNI SSRI IKDDVPNYPLAS SKMRFKAFRVSKSHSSKE 
QNFIFYVKDLRVLYDKLSVSIDSDIDSESVFKVYOT 
SEKPEESSPKN 

f08A.nt BB024 

TGAATATTAATAATAAAAAAAGGAGTAACAATGAAAATCATCAACATATO 

ACXSGCTGTAATTCT/^TGATAATGACACITTAAAAA 

AACCCAAAAAGAAACAACACAAGAAAAACCAAAATCTAAAGAAGA^ 

AAAACACATCTTGACTGGTTAAAACCCGCTTTAACTGGTGCTGGA 

ATAAAATAAAATCAGCACTTGATCATATAAAAACTCAACTTGATA^ 

AACCACTTTCAAAACTGTGGTTACAGAATTCTTO 

AACTGCAATAATGGTGGCTAA 



t08A.nt BB024 
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TGTAATTCTAATGATAATGACACTTTAAAAAACAAT^ 

AAAAAGAAACAACACAAGAAAAACCAAAATCTAAAGAAGAACTACT^^ 

ACATCTTGACTGGTTAAAACCCGCTTTAACTGGTC^ 

ATAAAATCAGCACTTGATCATATAAAAACTCAACTT^ 

CTTTCTVAAACTGTGGTTACAGAATTCT^ 

CAATAATGGTGGC 

f08A.aa BB024 

ILIIKKGVTMKIINILFCLFIA^ 

THIJ3WLKPALTGAGEFDKFLENDDDK^ 

CNNGG 

t08A.aa BB024 

CNSNDNDTLKNNAQQTKRRGKRDL5 1TQKETTQEKPKSKEELLREKLSDDQKTHIJ)WLKPALTGAGEFD 
DKIKSALDHIKTQLDSOTGIX^AEQQKTTFKTVVTEFFKNGDIDNFATGAVSNCNNGG 

f09A.nt BB025 

TGAATATTAATAATAAAAAAAGGAATAATAATGAAAATTATC 
ACGGCTGTAATTCTAATGATACTAATAATAGCCAAA 

AACACAAGAAAAACCTAAATCTAAAGAAGAACTTCTTAGAGAAAAGCTAAATGATAATC 
TGGTTAAAAGAAGCTCTGGGCAATGATGGAGAATTTAATAAAT^ 
CACTTGATCATATAAAGAGTGAACTTGACAGTTGTACTGGA 
CGTTCAGGAGGCCCTTAAAGGGGGCATAGACGGCTTTGAAAATACTC 

t09A.nt BB025 

tgtaattctaatgatactaataatagccaaacaaaa^ 

aagaaaaacctaaatctaaagaagaacttcttagagaaaagctaaatgataatcaa 

aaaagaagctctgggcaatgatggagaatttaataaattt^ 

gatcatataaagagtgaacttgacagttgtactggagataaggtt^ 

aggaggcc^ttaaagggggcatagacggctttgaaaatactgcaag™ 

f09A.aa BB025 

ILIIKKGIIMKIINII^LFLLMLNGCNSNOTNNSQTKSRQKRD 
LKEALGNIX3EFNKFIX3YDESKIKSALDHIK^ 

t09A.aa BB025 

CNSNDTNNSQTKSRQKRDLTQKEA5 1TQEKPKSKEELLREKLNDNQKTHLDWLKEALGNTC 

ALDHIKSEI^SCTGDKVENKNTFKQWQEALKGGIDGFENTASSTCKNS 

1 
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(AE001031) chemotaxis response regulator (cheY) [Archaeoglobus 
cheY gene product [Listeria monocytogenes] 


Ynel [Bacillus subtilis] >gil870926 response regulator 


ORF2 [Leptospira borgpetersenii] >splP24086IYLB3_LEPIN 
HYPOTHETICAL 


orfX; putative OrfX protein [Borrelia burgdorferi] | 


cheY gene product [Halobacterium salinarium] >pirlS58645IS58645 CheY 


spoOF [Bacillus subtilis] >gil 143601 SpoOF protein [Bacillus 


(AEQO 1 167) conserved hypothetical protein [Borrelia burgdorferi] | 


orf A [Borrelia burgdorferi] 
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LE 2, Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent d 
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(AF026270) PduF [Salmonella typhimurium] >splP37451IPDUF_SALTY | 


glycerol dfiffusion facilitator [Pseudomonas aeruginosa] j 


(AE001005) glycerol uptake facilitator, MIP channel (glpF) | 


(AEOOl 1 34) B . burgdorferi predicted coding region BB0238 [Borrelia ] 
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KAE001 132) gufA protein [Borrelia burgdorferi] >pirlC70127IC70127 j 


coded for by C elegans cDNA CEESS55F; coded for by C. elegans 
cDNA 


KAE000831) conserved protein [Methanobacterium thermoautotrophicum] | 


gufA gene product [Myxococcus xanthus] >gil49253 orfX gene 


(AE000757) hypothetical protein [Aquifex aeolicus] 
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TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 
Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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TABLE 4. Residues Comprising Epito-Bearing Fragments 



Query 


Residues Comprising Epito-Bearing Fragments 


flOl.aa 


rrom about JLys-oz to about Lriy-o4, irom about ber-1 1 1 to about 
Asp-1 13, from about Arg-136 to about Arg-139, from about Pro- 

1 QO Irx nkrvnt A on 1 OQ 

loy to about Asn-iyj. 


fll.aa 


from about Pro-38 to about Lys-40, from about Glu-92 to about 
Lys-yo. 


ilz.aa 


rrom about rro-zoo to about Asp-zW, trom about Asnoio to 
about Gly-338, from about Tyr-410 to about Gly-413, from about 
/\sp-*fio to aooui /\rg-*t-zu, iruni auoui rru-jjz iu oooui v<u-jj*i, 
from 





about Gln-643 to about Asp-645, from about Gin- 1061 to about 
ATg-iuoj, rrom aooui /\sn-i uu to aooui i^ys-i i oz. 


fl29.aa 


from about GIu-76 to about Arg-81, from about Lys-144 to about 


fl47.aa 


from about Gln-94 to about Thr-96. | 


^1 CO ~ ™ 1 

fl52.aa 


trom about (jly-3D to about Oiy-3 /, rrom about Lnnozl to about 

Lriy- dL j . 


fl54.aa 


from about Asn-39 to about Lys-41, from about Ser-74 to about 
uys-i /, iTom aooui oer-zij 10 aooui oiy-zi rrom aooui oci-jUj 

tn nltriiit A ctv-^O^i from nrtont A ( rr\-A0 r ) tf\ nrv\nt Acn -4'? A 
LU uUUUl /AoJJ-*?V/U, llUlil alA/ul /Ao^r - +ZrZ» IAJ aUvlUl xYo11^tx>*t. 


fl57.aa 


from about Lys-21 to about Asp-24, from about Ser-45 to about 

Tvr-47 


7l7.aa 


from about Arg-17 to about Asn-20, from about Thr-94 to about 
vjiy-yo. 


iloO.aa 


irorn aooui juysouj 10 dooui 1 yrouo. 


nyo.aa 


irum aooui i_*ys-izi 10 dDoui /\in-izj, iruiii duuui inrcr-z/o lu ouuui 
Lys-282, from about Glu-576 to about Tyr-578. 


f899.aa 


from about Asn-174 to about Asp- 177. 


f925.aa 


from about Lys-201 to about Asp-204, from about Phe-291 to 
about juys-zy4. 


ryzy.aa 


rrom aooui JrTO-uy 10 aooui /\sn- i*f i , irom aooui /\rg-zi i 10 <touui 


f933.aa 


from about Ser- 1 39 to about Lys- 143. 1 




■Prvxm q1v\ii1 C^\\r 1 tf\ o^\/\tit > Acn 1/18 

rrom aooui oiy lhd 10 aooui /\sn-i*fo. 


f943.aa 


from about Asp-58 to about Asp-60, from about Lys- 157 to about 
Asn-159, from about Asp-217 to about Asp-221, from about Lys- 

ocri * n oKrtnt Acn-9S4 frnrri aKmit Prn-9 f\7 tc% aKniit A Qn-^^vd frnm 

£+D\J Wj aLFUUl /\oii -Xr-J^T, XX Villi OXJSJXJX ilU"ZrUx» WJ dL/V/UL X> U*t , li villi 

about Gly-305 to about Trp-307. 


l7.JZ.aa 


llv/lll dUUUL OCl IVI OUvlllL OC1 


f4.aa 


from about Arg-64 to about Arg-67. 




rrom aooui ocr-o*f 10 aooui um-o / , rrom aooui /\sp-z j 1 10 aooui 
Tyr-233, from about Arg-296 to about Asp-300. 


f50.aa 


from about GIu-136 to about Gly-138, from about Asp-153 to 
ahnnt T v<i-1 5^ from about AstvTJIQ to aHniit A°.n-2Q1 from about 
Glu-458 to about Asn-461. 


f65.aa 


from about Glu-120 to about Asp- 122, from about Pro-204 to about 
Tyr-206. 


fS.aa 


from about Pro-263 to about Arg-265, from about Asp-274 to about 
Lys-278. 


f82.aa 


from about Tyr-66 to about Gly-68, from about Ser- 1 16 to about 
Lys-1 19, from about Asp-121 to about Gly-123, from about Pro 
128 to about Gly-131. 
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ioo.aa 


i rum doom aymi-i /y io oouui ami- i o i , ixuiii oXajul uybr wyz* iu 
about Asn-194, from about Lys-270 to about Asn-272, from about 

T VQ-97Q It* nVimit T vc-9R9 frnm ahmit tn ahnnt Asn-333 

L^iJO-jL 1 y IXJ ttUUUl l_<jr i i,OL, 11 villi dLnJUL Ao[rJJ 1 t\S alA/Ul imOll -J -J -J. 


f477.aa 


from about Pro-250 to about Lys-253. 


i4oo.aa 


irom aooui Juys-/o io aooui uys-/y> irom aoout /\sn-*H>o 10 aooui 
Asp-489, from about Lys-508 to about Gly-510, from about Asn- 

jj7 iv oouui vxiy**ju^. 


f494.aa 


from about Lys-76 to about Asn-78. 


tMo.aa 


irom aoout JLys-^z to aooui Asp- 


f523.aa 


from about Pro-202 to about Asn-206, from about Lys-255 to about 
lyr-Zjo. 


iDZo.aa 


irom aooui /\sn-o-> io aooui l^ys-oo, irom aooui /vsp-ijo io aooui 
vjiy-ioo. 


f577.aa 


from about Cys-18 to about Lys-22, from about Asn-297 to about 
Gln-300. 


i3o4.aa 


irom aoout rro-i to aoout Lys-iDD, rrorn aooui rro-zuu io aooui 
ocr-zi/^. 


f596.aa ; 


from about Arg-42 to about Asp-44, from about Asp-1 17 to about 
Tyr-1 19, from about Pro-205 to about Asp-207. 


toOU.aa 


from about Pro- 143 to about Asp- 145. 


f603.aa 


from about Phe-35 to about Ser-37. 


f607.aa 


from about Gln-67 to about Lys-70, from about Asp-273 to about 
lyr-275, irom aoout Asp-iJJ to about uiyoJo, irom aoout rro- 
359 to about Lys-362, from about Arg-409 to about Gly-41 L 


ioll.aa 


irom about Arg-133 to about Cjly-liD. 


f631.aa 


from about Pro- 132 to about Asn-136, from about Asn-159 to 
about Tyr-161, from about Pro-216 to about Asp-218, from about 
rro-zzu to aoout L»y S-ZZ3. 


f688.aa 


from about Lys-266 to about Asp-268, from about Lys-27 1 to 
aooui /\sn-z/ irom aooui i^ys-ji j io aooui i-,ysoio. 


f704.aa 


from about Lys-250 to about Lys-253. 


f707.aa 


from about Lys-131 to about Asp- 134, from about Asp-246 to 
about Asn-249. 


i/09.aa 


irom about 1 yroy to about vjiy-4Z, irom aoout L*ys-i4o to aoout 
vjiy- 1 ju, irom aooui /vrg-zoy io aooui oiy z / iruni aooui oci 
/Lf%f\ fo 5iKr»iit Tvr-AAR frnm sthrmt A cn-45lQ tn a hoi it A°.n-4Q1 frorn 

*Tvl\J WJ aUKJUV 1 Jf I^TVIO, UUiU (UA/Ul /^all^TO^ IAJ CUAiUl Aoll -, t71j 11V/1U 




oKniit T vq-S7S tn ahnnt A<!Ti-S7R from aKnnt Pm-n49 to ahniit 1 .v<i- 

OlAJUL MS J / J 1AJ dUUUl AdjTJ / O, 11 Villi OLrvlUI. i 1V^VTTZ» lAJ <HJ\J\*l M^tjoT 

644. 


fl97.aa 


from about Pro-217 to about Asp-219, from about Glu-675 to about 
Asp-678, from about Pro-687 to about Asn-689, from about Glu- 
694 to about Gln-696. 


f200.aa 


from about Arg-174 to about Phe-179. 


f208.aa 


from about Arg-326 to about Ser-328. 


f210.aa 


from about Pro-191 to about Ile-194. 


f221.aa 


from about Asn-133 to about Asn-135. 


£253.aa 


from about Arg-191 to about Gly-194. 


f269.aa 


from about Ser-271 to about Thr-273, from about Asp-284 to about 
Gly-286. 


f29.aa 


from about Pro- 159 to about Ser-161. 


£290.aa 


from about Pro-240 to about Gly-244. 


f291.aa 


from about Gln-267 to about Lys-269. 



^ 
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f296.aa 


from about GIu-98 to about Lys-101. j 


G.aa 


from about Asn-241 to about Lys-245. 


f30.aa 


ficom about Asn-156 to about Tyr-159, from about Asn-178 to 
about Lys- 180. 


f939.aa 


from about Ser-245 to about Asn-249. 1 


f739.aa 


from about Asn-80 to about Tyr-82, from about Lys-208 to about 
Ser-210. 


f742.aa 


from about Ser-141 to about Asp- 145, from about Asn-222 to about 
Gln-225, from about Asp-243 to about Tyr-247, from about Asn- 
249 to about Asn-25 1 


f743.aa 


from about Arg-11 1 to about Gly-1 14, from about Pro-131 to about 
Asd-134 


f790.aa 


from about Thr-40 to about Asn-42, from about Ser-53 to about 

from about Lvs-215 to about Aso-218 from about Asn-274 
to about Gly-277. 




from ahont Val-87 tn ahnnt 5ser-R4 from about Ser-102 to about 

11 Villi CUAS 11 1 V Ul IXJ (UA/Ul tTT , UvUI OUvUL Ovsl X \JJJ lu auuui 

Asn-104, from about Gln-127 to about Tyr-130, from about Lys- 
309 to about Asn-3 14, from about Lys-375 to about Thr-377, from 




about Pro-51 1 to about His-513 from about Thr-515 to about Aso- 
517. 


f7Q7 aa 


from about Pro-1 19 to about Glv-122. from about Lvs-166 to about 
Asn-169. 


f799.aa 


from about Asn-3 1 to about Asn-34, from about Gln-44 to about 
Asn-47, from about Pro- 123 to about Gly-125. 


f814.aa 


from about Ser-120 to about Ser-122, from about Arg-636 to about 
Asn-638, from about Cys-967 to about Ser-969. 


f820 aa 


from about Thr-563 to about Tyr-565. 


f850 aa 


from about Tyr-159 to about Tyr-164, from about Gln-375 to about 
Asp-379. 


f853 aa 


from about Thr-180 to about Lys- 184, from about Arg-231 to about 
Asp-233, from about Asn-252 to about Gly-254. 


f859.aa 


from about Lys-46 to about Ser-52, from about Pro-88 to about 
Asn-91, from about Asn-1 17 to about Asp- 120. 


f861.aa 


from about Asp-38 to about Lys-40, from about Lys-219 to about 
Asn-225. 


f368.aa 


from about Gln-228 to about Asn-231. ! 


f371.aa 


from about Tyr-109 to about Asn-1 11, from about Asn-1 62 to 
about Gln-164. 




frnm abnnt Acn-1 \ R tn abnnt T 79 frnm about Ser-26Q to about 

IXVIlil OUUUL Adli*l 1 0 Iks dUvJUt LY5 lirXi, llvlll aUUUl Ot>l 4+\Js WJ auuui 

Gly-271, from about Lys-370 to about Asp-373, from about Asn- 
509 to about Lvs-51 1 from about Lvs-705 to about Are-707. from 




about Thr-912 to about Glv-914 from about Pro-1213 to about 
Asp-1216, from about Asn-1491 to about Arg-1493. 


f527.aa 


from about Gys-20 to about Gln-22, from about Asn-3 8 to about 
Asn-40, from about Phe-1 12 to about Asp-1 14, from about Lys- 160 
to about Asn-162, from about Ser-199 to about Asp-201, from 
about 




Gln-258 to about Gly-261, from about Arg-282 to about Asn-284, 
from about Ser-297 to about Asp-299. 


f541.aa 


from about Ser-68 to about Asn-71. j 


f604.aa 


from about Lys-77 to about Gly-79, from about Lys-201 to about 
Asn-203, from about Asp-252 to about Asp-254, from about Tyr- 
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^47 tn ahnnt frlv-^Sft fmm ahnnt Acn-^14 tn ahnnt Tm-Slfi 
J*t/ IU aUUUL tJiy-JJU, U U 111 aUUUL -rVap-JJ.** LU aUUUl 1ILJ-JXU. 


1 / JO. oil 


fVinTYi ohniit T \7c_9n tn ohrait A cn_OA fmm o limit A i-rr_ 1 ATI tn ahnnt 
1IU1I1 aUUUl i-/jrl> ZV/ LU aUUUl /\a*l _ Z/+, ilUili aOUUL /\lg-l*t/ IU alXJUl 

Ser-153, from about Ser-231 to about Lys-233. 


i / jz.aa 


fmm dhmit TVir_1 1 Q tr% ahnnt T vc_1 01 frr\m oKnut Pm-AOO tn a Km it 

iruni oouul i ill- 1 ly lo aouui i-»ys xzz, rrom aooui llo-h^aj iu auvm 
Glv-422 


f798.aa 


from about Asp-33 to about Thr-36, from about Lys-1 80 to about 
His- 183 


ft&5.aa 


from about Pro- 100 to about Asn-102, from about Asp- 145 to 
about Phe-147 

UUWUl X XXt> x~ / • 


f32.aa 


from about Lys-1 8 to about Asn-20. 




fmm ahnnt Acn 10^ tn ahnnt T pmi 1 QK fmm ahnnt CUnJJAA tn 
Hum aDOUl /\SI1 17J lO aOUUL 1X11-17-), lrOLLl aDOUl VJlli Z*fO IU i 

about Lys-250. 


pica QQ 


IIUIU aDOUl OCr-^U lO aDUUl /\i>D-*f y. 


f30Laa 


from about Lys-178 to about Lys-180, from about Ser-401 to about 
i y r-*fu*f. 


fTtt a"a 

ij / j.aa 


irum ououi oiy-oo to auum i^ys-^u, irurn auouc /vsn- d jy 10 aooui 

T fmm ahnnt G1ii-n^a> tn ahnnt ^f»r-nS7 
JL»yj> J*rZ, 11UI11 oUUUl VJ1U - UJ*t LU OUUUL OCI UJ / . 


f384.aa 


from about Pro-250 to about Asn-252, from about Asp-266 to 

ahnnt T vq-9^R 


f446.aa 


from about Asp-20 to about Ser-26, from about Asn-146 to about 
T vq.14.0 


f^A9 aa 


frnin ahnnt Afa_ftn tn ahnnt frit/— 51ft fmm ahnnt A t* 1 tn ahnnt 
11 UW aUUUl AVlg OvJ LU dLAJUL VJiy-OO, IIUIIL aUUUL r\Ig-lUJ LU aUUUl 

Asn-165 


f*)3 aa 


from about Asn-152 to about Asd-155 


fl05 aa 


from about to about Phe-50 

XXVSXX1 UlA/Ul /jLOf^ i LU uUUUl A Alls — /V/» 


fl50 aa 


from about Thr-214 to about Aso-218 from about Asd-256 to 
about Asp-259. 


f219.aa 


from about Asn-77 to about Asn-81 , from about Asp-1 1 1 to about 
Asn-115. 


f229.aa 


from about Gln-61 to about Asn-63. 


f32.aa 


from about Lys-1 8 to about Asn-20. 


flRfiaa 


from about 1 vs-305 to about Tvr-308 

XX V? XXX Oi/ViXX XwJTk> m*\Jm* %\M IllA/Ut X Jr X •JV/U* 


£216.aa 


from about Ser-105 to about Asn-107. 


f328.aa 


from about Asn-105 to about Asp- 107. 


ijjz.aa 


rrom aooui »>er- £ K) xo aooui /\sn-«*y. 


f867.aa 


from about Thr-3 to about Gly-5, from about Lys-1 56 to about Ser- 
159. 


tooo.aa 


rrom about Arg-y4 to aooui oiy-yo, nom aoout ito-zj / to aoout i 
oiy-zoi, rrom aooui rro-zyj 10 aooui /\sp-zv /, rrom aooui /vrg- 
340 to about Asp-342. ! 


lo/z.aa 


rrom aooui oer- 1 y 10 aooui L.ys-zj, rrom aooui i nr- 1 yy 10 aooui 

Acn- 14.9 frnm aVmut S»pr-9J19 tn ahnnt Tvr-TKfi frntn ahnnt *2f»r-^1 1 
Aoj/^i^, iiuixi ouvui oci z»Oij iu cuiuul x yi-^ou, XlUXXi duUUl OC1 J 1 1 

to about Ser-313. 


f886.aa 


from about Thr-83 to about Asp-85, from about Asp- 106 to about 
Lys-108, from about Lys-143 to about Gly-147, from about Asp- 
186 to about Asn-191. 


f888.aa 


from about Asn-65 to about Asp-67. 


f893.aa 


from about Asn-203 to about Asn-207, from about Thr-446 to 
about Asn-450. 


f605.aa 


from about Arg-31 to about Asp-33. 


f606.aa 


from about Asn-68 to about Gly-71, from about Asn-136 to about 
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uyh-kDzfy irom aooui /\sn-zzj> to aooui i yr-zzo, iTorn aouui oer- 
276 to about Tyr-279, from about Pro-362 to about Asn-365, from 




aooui /vrgouj to aoout irpou/. 


fA70 a a 
IO /y.aa 


rrom aooui L.ys-i3*f 10 aooui /vsp-i do, rrom aooui Lrys-zz^f 10 
about Arg-226, from about Asn-260 to about Asp-264, from about 

f~w}ii„'\(\'\ fn a Km it T vc.^/tA frnm o Vim it A cm "5 ft 7 tn aKnnt OK/ ^RQ 
VJIU-JOJ IO aOOUl JUySOOO, XXUIU dOOUl /\5pOo / IO dDOUl vJiy- J07, 

frnm 




about Tyr-441 to about Lys-443, Irom about Arg-501 to about Tyr- 
504 


fl 1-12.33 


from about Pro-91 to about Asn-93, from about Pro-181 to about 

A cr\_1 Q< frnm oKniit T vq-OAA tn siKnnt Q/»r_9A5l 
Aap-lOU, 11 Ulll aUUUl L/jfyZfT*! LU dUUUl OCI-ZHO. 


f 11-4.33 


from about Asn-160 to about Lys-163. 


Il*Ho,aa 


irom aooui rro-yz 10 aooui oin-yj, nom aooui juys-iz.} 10 aooui 

Thr- 1 9S frnm nhniit T v«-9 1 S tn nHrait A crw9 1 0 

llU-l^J) 11 Ulll aUUUL l*rjro~Zl«/ LU OUUUl Aop^Xly. 


fl7-6.33 


from about Pro-36 to about Glu-38. 




irom aooui oer-iiw- to aoout oer-iuo, trom aoout oin-zJU to aoout 
Asn-232. 


"f1Q_il QQ ! 


nom aooui vai-/v to aoout i nr-oz, nom aoout jrro-130 to aooui 
Gly-201. 


f 1 0-n a* 


iiuiii duuuL /vj>p-z*+ to ouuui j^ysou, i rum aooui rrooo 10 aooui 
Glu-38. 


•pi 1 _A aa 
IZ 1 -H.aa 


rrom aooui v^ys-z*f 10 aoout /\sn-zo. 


1ZO Z.dil 


fiy>m aVv\nt Cor 99 trt ahniit T \#c ftfl Aytm aKnnt T'xrr 9*7/1 tr» aKnut 

rrom aooui oer- / / to aoout i^ys-ou, rrom aooui r yr-z/*t 10 aoout 
Asn-277. 




iruiii dooui vjiu-jj> to aoout /vrg-3 /, rrom aoout oin-oz to aooui 

A^n-ftS frnm ahnut rwln-l S7 tn ahnnt Acn-1 SO 

■fVoll 11 villi dUv/Ul VJ111U 1 WJ Ol/V/liL AjIIIJ/i 


f31-2.aa 


from about Arg-95 to about Arg-97, from about Asn-297 to about 

A^n-900 


f4-15 aa 


frnm ahniit t*rn-1 89 tn aKniit Acn-1 frnm aHnnt T vc-990 tn aHnnt 
11 will aUSJUL nirlO^ LU (ua/UL Ao|rl(rr, 11U111 aUUUL o tU aUUUl 

Asp-222. 


f4-S0.33 


from about Thr- 109 to about Asn-1 11. 


fA9-1 a a 
1 .da 


rrom aooui /\sn- d d to aoout /\sno / , rrom aooui /\rg-o i to aooui 
Ser-84, from about Asp-94 to about Asn-97. 




from oKrviit A cn_8^ tr\ oKrait (~Z\\/-$lf\ 
11U111 dUUUL /Vall-O J LU aUUUl VJiy-OU. 


f47-2 aa 

1*t / Xf.aa 


frnm siHnut ^Af— 9Q tn nHrait Acrw^^ frnm ahniit Acn-QA tn aHnnt 
11U111 auUUl kjCI Z7 LU aUUUl Asp-JJ, 11 Ulll aUUUL Aoll 7*t LU aUUUL 

Lys-99, from about Pro-152 to about Lys-157. 


f49-2 aa 


frnm JiHniit Acn-^iS9 tn siVtmit frlv-ril^l 

11 Ulll OUUUl /ull"*rJX LU OlA/Ul \Jljf-*rJ*T. 


f5-14.33 


from about Glu-102 to about Asp- 106, from about Thr-272 to about 
Asn-275, from about Glu-313 to about Asn-315, from about Ser- 

^70 tn flhnnt 5Ji*r-^79 


f5-15 aa 1 


frnm ahniit T 70 tn ahnnt Olv-1 7^ frnm ahnnt Acn.1 04 tn 

Ai«-mi dxjyjxxx. J-#jr o 1 t\J tu auuui \jijr i / -7> 11 Ulll tfUUUL /Aoll 1 Z7*"T iu 

about Gly-196. 


151-2 aa 


frnm flhnnt A«n-^09 tn aHniit T vq-^04 

11 Ulll CliAJUL ru^TJUL LU OUUUl i^tjo Jlrt. 


f6-2Laa 


from about Glu-38 to about Asn-42. from about Are-84 to about 
Gly-87. 


f6-27.aa 


from about Asp-67 to about Asn-69, from about Arg-85 to about 
Asn-89, from about Lys-168 to about Gly-171, from about Lys-179 
to about Asn-181, from about Ser-380 to about His-382. 


f6-5.aa 


from about Ser-67 to about Asn-7L 


f7-30.aa 


from about Pro-94 to about Asp-96, from about Lys-144 to about 
Arg-147. 


f76-Laa 


from about Asn-30 to about Lys-35, from about Lys-1 13 to about \ 



1 
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Gly-1 16, from about Glu-1 19 to about Lys-121 . 


f8-10.aa 


from about Pro-25 to about Lys-32, from about Ser-168 to about 
Thr-172. 


f01a.aabb001 


from about Pro-123 to about Asp-125, from about Ser-179 to about 
Asp-181, from about Lys-255 to about Gly-259. 


J>b0011 


from about Ala8 about Arg 17, from about Tyr3 1 to about Gly40, 
from about Ser65 to about Lys78, from about Val93 to about 
Asp 102, from about Serl20 to about Ilel29, from about Prol56 to 
about Glul70, from about Lysl87 to about Asn 196, from about 
His205 to about Lys214, from about Gly226 to about Glu235, fro 
about Gln253 to about Asn266, from about Glu283 to about 
Glu293, from about Leu31 1 to about De320, from about Arg326 to 
about Gly335, from about Pro340 to about Ala349 


f02aaa_bb002 


from about Tyr-169 to about Asn-171, from about Tyr-242 to about 
Asn-245, from about Lys-264 to about Asp-267. 


_bb9 


from about Met7 to about Lysl6, from about Lys47 to about Ser57, 
from about Asn80 to about Ser89, from about Gly 103 to about 
Glul 13, from about Lysl25 to about Prol33, from about Lysl38 
to about Alal47 i 


fO3a.aa_bt>006 


from about Asp-54 to about Thr-57, from about Lys-201 to about 
His-204. 


J>b014 


from about Pro23 to about Gln31, from about Ser37 to about 
Asp45, from about Leu76 to about Asn84, from about Leu76 to 
about Val84, from about Ser89 to about Asn97, from about Serl05 
to about Lysl 13, from about Asnl20 to about Metl28, from about 
Asnl59 to about Gly 167, from about Lysl73 to about Ball81 


_bt>023 


from about Aspl7 to about Gly27, from about Arg40 to about 
Asp48, from about Val64 to about Asp72, from about GlulOS to 
about Thrl 13, from about Serl41 to about Gly 150, from about 
Asp 155 to about lleloJ, from about Asnlo4 to about L,yslyo, trom 
about Ile219 to about Pro227, from about Ser230 to about Phe238, 
from about Sei241 to about Asu250, from about Asp270 to about 
Val278, from about Ser285 to about Leu293, from about Glyu307 
to about Ser315, from about Lys327 to about Asn335 


fD8a.aa_bb024 


from about Asn-30 to about Asp-33, from about Ser-1 16 to about 
Asn-1 18, from about Asn- 154 to about Gly- 156. 


fl>9a.aaj>b025 


from about Asn-30 to about Ser-35, from about Thr-145 to about 
Asn-148. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 8 ,line 8 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table l ;or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary to 
any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide having a nucleotide sequence identical 
to a nucleotide sequence in (a) or (b) of claim 1. 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which encodes an 
epitope-bearing portion of a polypeptide in (a) of claim 1 . 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion of a 
polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic acid 
molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 
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(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1 ; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 

(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to a 
sequence selected from the group consisting of an amino acid sequence of any one of the 
polypeptides in Table 1 . 

13. An isolated polypeptide antigen comprising an amino acid sequence of an B. burgdorferi 
epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide sequence 
encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

1 6. A vaccine, comprising: 

(1) one or more B. burgdorferi polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutical^ acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective antibodies 
in an animal to a member of the Borrelia genus. 

1 7. A method of preventing or attenuating an infection caused by a member of the Borrelia genus 
in an animal, comprising administering to said animal a polypeptide of claim 9, wherein said 
polypeptide is administered in an amount effective to prevent or attenuate said infection. 

18. A method of detecting Borrelia nucleic adds in a biological sample comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1 , under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Borrelia nucleic acid 
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sequences present in the biological sample. 

19. A method of detecting Borrelia nucleic acids in a biological sample obtained from an animal, 
comprising: 

(a) amplifying one or more Borrelia nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Borrelia nucleic acid 

20. A kit for detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

2 1 . A method of detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 
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